Array Partitioning #174

steiltre · 2023-07-18T04:36:16Z

@youd3 brought an imbalance in the array partitioning to my attention. In its current implementation, if P processes are storing P+1 items in a ygm::container::array, the first approximately P/2 ranks will each get 2 items, leaving almost half of the ranks completely empty. This occurs because we determine a global block size of ceil(k/P) for storing k items on P processes, and we assign full blocks until we have run out of data.

This imbalance becomes less severe as the average number of items per rank increases. I believe the fraction of completely empty ranks is bounded by 1 - floor(k/P)/ceil(k/P) in the worst case. This is enough to realistically leave entire compute nodes empty at large scales when k/P is relatively small (measured in hundreds or possibly thousands).

The fix is relatively simple. It just requires two different chunk sizes to be used for the data, and a little bit more math to determine which rank owns a particular index. To maintain consistency, the bag.rebalance() method needs to be updated when array partitioning is changed.

The text was updated successfully, but these errors were encountered:

steiltre · 2023-11-28T23:06:10Z

Addressed in PR #187.

steiltre mentioned this issue Nov 27, 2023

Bugfix/array partitioning #187

Merged

steiltre closed this as completed Nov 28, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Array Partitioning #174

Array Partitioning #174

steiltre commented Jul 18, 2023

steiltre commented Nov 28, 2023

Array Partitioning #174

Array Partitioning #174

Comments

steiltre commented Jul 18, 2023

steiltre commented Nov 28, 2023