[CPU] enable int8_dynamic_activation_int4_weight with Int4CPULayout #2128

Xia-Weiwen · 2025-04-25T10:22:16Z

Summary
Enable quantization of model with int8_dynamic_activation_int4_weight and Int4CPULayout.

Test plan

pytest test/quantization/test_quant_api.py -k test_8da4w_cpu

pytorch-bot · 2025-04-25T10:22:19Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/ao/2128

📄 Preview Python docs built from this PR

Note: Links to docs will display an error until the docs builds have been completed.

✅ No Failures

As of commit 35ece3b with merge base e3db2b2 ():
💚 Looks good so far! There are no failures yet. 💚

This comment was automatically generated by Dr. CI and updates every 15 minutes.

jerryzh168 · 2025-04-28T23:48:00Z

test/quantization/test_quant_api.py

+                torch.compile(m, fullgraph=True, dynamic=True),
+                *example_inputs,
+            )
+            assert "_weight_int4pack_mm_for_cpu" in code[0]


I remember this op is weight only quant?

Yes. This op is used here because we don't have an op to compute da8w4 on CPU yet. So it will fallback to explicit dequantization and call of this op.

[CPU] enable int8_dynamic_activation_int4_weight with Int4CPULayout

0581451

facebook-github-bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Apr 25, 2025

Merge branch 'main' into da8w4_with_int4_cpu_layout

dffbbab

Xia-Weiwen added cpu quantize topic: new feature Use this tag if this PR adds a new feature labels Apr 25, 2025

Xia-Weiwen added 2 commits April 25, 2025 03:27

Fix format issue

9fb7f77

Merge branch 'main' into da8w4_with_int4_cpu_layout

35ece3b

Xia-Weiwen requested a review from leslie-fang-intel April 28, 2025 11:02

jerryzh168 reviewed Apr 28, 2025

View reviewed changes

leslie-fang-intel approved these changes Apr 29, 2025

View reviewed changes

Xia-Weiwen marked this pull request as ready for review April 29, 2025 02:01

Xia-Weiwen requested a review from jerryzh168 April 29, 2025 03:16

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[CPU] enable int8_dynamic_activation_int4_weight with Int4CPULayout #2128

[CPU] enable int8_dynamic_activation_int4_weight with Int4CPULayout #2128

Xia-Weiwen commented Apr 25, 2025

pytorch-bot bot commented Apr 25, 2025 •

edited

Loading

jerryzh168 Apr 28, 2025

Xia-Weiwen Apr 29, 2025

[CPU] enable int8_dynamic_activation_int4_weight with Int4CPULayout #2128

Are you sure you want to change the base?

[CPU] enable int8_dynamic_activation_int4_weight with Int4CPULayout #2128

Conversation

Xia-Weiwen commented Apr 25, 2025

pytorch-bot bot commented Apr 25, 2025 • edited Loading

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/ao/2128

✅ No Failures

jerryzh168 Apr 28, 2025

Choose a reason for hiding this comment

Xia-Weiwen Apr 29, 2025

Choose a reason for hiding this comment

pytorch-bot bot commented Apr 25, 2025 •

edited

Loading