How to handle FC layers with non-constant inputs? #11941

jinevening · 2023-11-08T06:29:37Z

jinevening
Nov 8, 2023
Collaborator

Let's discuss how to handle FC layers with non-const inputs.

Current status

--replace_non_const_fc_with_batch_matmul pass converts FC with non-const weights (regardless of whether ifm is const or not) into batch matmul. This is for ease of quantization (circle quantizer does not support FC with non-const weights).

Before

ifm (const) -------+--> FC --> ofm
wgt (non-const)--+

After

ifm (const) -------+--> BMM --> ofm
wgt (non-const)--+

In this way, ifm is quantized per-tensor, because it is treated as an activation tensor.

Alternative

We can do as follows.

Update --replace_non_const_fc_with_batch_matmul to do conversion only when ifm is non-const -> This will allow FC with const ifm and non-const weights.
Add a pass (e.g., --replace_non_const_fc_with_transposed_fc) which does the below conversion for FC with const ifm and non-const weights.

Before

ifm (const) -------+--> FC --> ofm
wgt (non-const)--+

After (Note that ifm and wgt are switched)

wgt (non-const) --> Transpose ----+--> FC --> Transpose -> ofm
ifm (const) --> Transpose ----------+

This will allow per-channel quantization of ifm, but Transpose Ops are added. So, the alternative can have a better accuracy with potential performance penalty (Transpose Ops).

If the Transpose Ops can be canceled out with adjacent operators, this idea seems cool.

The alternative was proposed by @parjong

CC @parjong @ejjeong

jinevening · 2023-11-08T08:18:01Z

jinevening
Nov 8, 2023
Collaborator Author

After offline discussion, we decided to do the first item of the alternative.

The second item may be discussed later, when it can bring a clear benefit.

0 replies

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

How to handle FC layers with non-constant inputs? #11941

{{title}}

Replies: 1 comment

{{title}}

Select a reply

How to handle FC layers with non-constant inputs? #11941

jinevening Nov 8, 2023 Collaborator

Current status

Alternative

Replies: 1 comment

jinevening Nov 8, 2023 Collaborator Author

jinevening
Nov 8, 2023
Collaborator

jinevening
Nov 8, 2023
Collaborator Author