Optimize load distribution between nodes #719

vishwamartur · 2025-02-20T16:56:03Z

Related to #701

Implement load distribution between nodes based on flops and memory.

Partitioning Strategy:
- Add flops attribute to Partition class in exo/topology/partitioning_strategy.py.
- Update map_partitions_to_shards function to consider flops when mapping partitions to shards.
- Add get_flops method to PartitioningStrategy class to calculate the flops of each partition.
Ring Memory Weighted Partitioning Strategy:
- Update partition method in exo/topology/ring_memory_weighted_partitioning_strategy.py to consider both memory and flops for partitioning.
- Add calculate_flops_weight helper function to calculate the weight of each node based on its flops.
Node Class:
- Update Node class in exo/orchestration/node.py to implement logic to sort nodes by flops for load distribution.
- Add sort_nodes_by_flops method to sort nodes by their flops.
Inference Engine:
- Add get_flops method to InferenceEngine class in exo/inference/inference_engine.py to get the flops of the current node.
Sharded Inference Engine:
- Add get_flops method to MLXDynamicShardInferenceEngine class in exo/inference/mlx/sharded_inference_engine.py to get the flops of the current node.

Related to exo-explore#701 Implement load distribution between nodes based on flops and memory. * **Partitioning Strategy**: - Add `flops` attribute to `Partition` class in `exo/topology/partitioning_strategy.py`. - Update `map_partitions_to_shards` function to consider `flops` when mapping partitions to shards. - Add `get_flops` method to `PartitioningStrategy` class to calculate the flops of each partition. * **Ring Memory Weighted Partitioning Strategy**: - Update `partition` method in `exo/topology/ring_memory_weighted_partitioning_strategy.py` to consider both memory and flops for partitioning. - Add `calculate_flops_weight` helper function to calculate the weight of each node based on its flops. * **Node Class**: - Update `Node` class in `exo/orchestration/node.py` to implement logic to sort nodes by flops for load distribution. - Add `sort_nodes_by_flops` method to sort nodes by their flops. * **Inference Engine**: - Add `get_flops` method to `InferenceEngine` class in `exo/inference/inference_engine.py` to get the flops of the current node. * **Sharded Inference Engine**: - Add `get_flops` method to `MLXDynamicShardInferenceEngine` class in `exo/inference/mlx/sharded_inference_engine.py` to get the flops of the current node.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Optimize load distribution between nodes #719

Optimize load distribution between nodes #719

vishwamartur commented Feb 20, 2025

Optimize load distribution between nodes #719

Are you sure you want to change the base?

Optimize load distribution between nodes #719

Conversation

vishwamartur commented Feb 20, 2025