[FX] Support weight quantization for operations where `weight_port_id` != 1 #3334

siddhant-0707 · 2025-03-08T02:18:54Z

Changes

Updated the FX backend’s _get_input_scale_shape to use the FX insertion point shape and, when available, the actual weight tensor’s shape to compute the per‑channel scale shape.
Adjusted statistics collector in _get_stat_collector so that the reduction and aggregation axes are derived using the same channel axes as used for scale shape computation.

Related tickets

Issue #3206

Tests

All tests run successfully except:
test_smooth_quant.py::TestTorchSQAlgorithm::test_smooth_quant_algo[LinearMultiShapeModel-reference_values0] RuntimeError: shape '[1, 1, 2]' is invalid for input of size 1.

siddhant-0707 added 2 commits March 7, 2025 20:59

Support weight channel axes

b8203a5

Change minmax algo to support channel axes for ConvTranspose

474e6b7

siddhant-0707 requested a review from a team as a code owner March 8, 2025 02:18

github-actions bot added the NNCF PTQ Pull requests that updates NNCF PTQ label Mar 8, 2025

siddhant-0707 added 3 commits March 7, 2025 21:36

add comment back algorithm.py

7319f9f

add comment algorithm.py

3308b40

refactor parameter names in

0ee4f8b

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[FX] Support weight quantization for operations where `weight_port_id` != 1 #3334

[FX] Support weight quantization for operations where `weight_port_id` != 1 #3334

siddhant-0707 commented Mar 8, 2025

[FX] Support weight quantization for operations where weight_port_id != 1 #3334

Are you sure you want to change the base?

[FX] Support weight quantization for operations where weight_port_id != 1 #3334

Conversation

siddhant-0707 commented Mar 8, 2025

Changes

Related tickets

Tests

[FX] Support weight quantization for operations where `weight_port_id` != 1 #3334

[FX] Support weight quantization for operations where `weight_port_id` != 1 #3334