Slide View : 15-418/618 Spring 2014

Parallel Programming Case Studies

Previous | Next --- Slide 43 of 63

Back to Lecture Thumbnails

sbly

I'm confused. Wouldn't we just scan each of the arrays separately in parallel? This is real easy to do if it's stored as a two-dimensional array. Why do we use this "head-flag" representation?

This comment was marked helpful 0 times.

kayvonf

@sbly: Thought experiment: What if array 1 was one million elements, and array two was 10 elements.

This comment was marked helpful 0 times.

devs

Could someone elaborate one Kayvon's thought experiment? The work being done on array1 is much more costly than on array 2, but there's no way getting around that. Is it just that the head flag indicates where the partition is being made which can then later be used to call scan on each of the arrays separately?

This comment was marked helpful 0 times.

kayvonf

I interpretted @sbly's suggestion as: if we have N subarrays, just parallelize over the subarrays (yielding N independent pieces of work). I presume that each sort would be carried out sequentially.

So in the case of N=2, imagine if one of those subarrays was huge, and the other tiny. What problems do you see with the parallelization scheme I just described?

This comment was marked helpful 0 times.

vrkrishn

The problem with the given parallelization scheme is that simply splitting up the tasks into N different tasks does not account for workload imbalance, and that the performance gains that we can gain through parallelism would be diminished.

This is the same problem that we were presented with in the Prog 1 Mandlebrot of Asst. 1.

This comment was marked helpful 0 times.

eatnow

Is it correct to say that this approach is useful only because we have the "head-flag" representation? Suppose we have a list of lists but do not know where they are in the original data, this will not work? I.e. we have a[] and maybe data[], but do not have flag[].

This comment was marked helpful 0 times.