MUSA: Use Monkey Patching to Automatically Convert CUDA Backend to MUSA #583

yeahdongcn · 2025-02-21T14:53:33Z

This PR introduces monkey patches to automatically convert CUDA backend to MUSA backend.

Key Updates

util.torch_auto_backend.py
- Added global variables CUDA and CUDA0 to replace hardcoded "cuda" and "cuda:0".
- Implemented monkey patching to automatically convert CUDA backend to MUSA.
- Added test cases for MUSA.

Testing Done

make dev_install

python ./tests/torch_auto_backend_test.py

Torch backend loaded: CUDA=musa, CUDA0=musa:0
musa musa:0
tensor([1.2000, 2.3000], device='musa:0')
tensor([1.2000, 2.3000], device='musa:0')
True
8
<torch_musa.core.device.Device object at 0x7ff626246320>
tensor([1.2000, 2.3000], device='musa:0')
0

Ran inference using (the output tokens still have some problems):

numactl -N 1 -m 1 python ./ktransformers/local_chat.py --cpu_infer 33 \
  --model_path deepseek-ai/DeepSeek-R1 \
  --gguf_path /models/hub/models--unsloth--DeepSeek-R1-GGUF/snapshots/02bcc0a0f68146dae57942804d82bdf0cc636003/DeepSeek-R1-Q4_K_M \
  --optimize_rule_path /ws/ktransformers/optimize/optimize_rules/DeepSeek-V3-Chat.yaml

Atream · 2025-02-22T08:22:51Z

ktransformers/operators/RoPE.py

-        generate_device: str = "cuda",
-        prefill_device: str = "cuda",
+        generate_device: str = CUDA,
+        prefill_device: str = CUDA,


prefill_device and generate_device is not needed to change.
Change it in your custom yaml file. It's the same elsewhere.

I just want to ensure consistency throughout the code (Both CUDA and "cuda" will be present.).
Would you like me to revert this change?

ktransformers/operators/linear.py

Atream · 2025-02-22T08:30:21Z

Thank you for your contribution.
We will merge after planning and testing a unified architecture that is compatible with various GPUs. Until then, please use your own branches first. Remember to frequently merge the main branch to stay synchronized with us and achieve better performance.

Signed-off-by: Xiaodong Ye <[email protected]>

Atream reviewed Feb 22, 2025

View reviewed changes

ktransformers/operators/linear.py Outdated Show resolved Hide resolved

Atream reviewed Feb 22, 2025

View reviewed changes

ktransformers/operators/linear.py Outdated Show resolved Hide resolved

yeahdongcn force-pushed the musa-py branch 2 times, most recently from 89435df to 4792b81 Compare February 25, 2025 00:50

MUSA: Use monkey patch for auto converting CUDA backend to MUSA backend

70457b7

Signed-off-by: Xiaodong Ye <[email protected]>

yeahdongcn force-pushed the musa-py branch from 4792b81 to 70457b7 Compare February 25, 2025 01:09

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

MUSA: Use Monkey Patching to Automatically Convert CUDA Backend to MUSA #583

MUSA: Use Monkey Patching to Automatically Convert CUDA Backend to MUSA #583

yeahdongcn commented Feb 21, 2025 •

edited

Loading

Atream Feb 22, 2025

yeahdongcn Feb 22, 2025 •

edited

Loading

Atream commented Feb 22, 2025

MUSA: Use Monkey Patching to Automatically Convert CUDA Backend to MUSA #583

Are you sure you want to change the base?

MUSA: Use Monkey Patching to Automatically Convert CUDA Backend to MUSA #583

Conversation

yeahdongcn commented Feb 21, 2025 • edited Loading

Key Updates

Testing Done

Atream Feb 22, 2025

Choose a reason for hiding this comment

yeahdongcn Feb 22, 2025 • edited Loading

Choose a reason for hiding this comment

Atream commented Feb 22, 2025

yeahdongcn commented Feb 21, 2025 •

edited

Loading

yeahdongcn Feb 22, 2025 •

edited

Loading