Skip to content

Commit

Permalink
Update docs/source/openvino/export.mdx
Browse files Browse the repository at this point in the history
Co-authored-by: Helena Kloosterman <[email protected]>
  • Loading branch information
nikita-savelyevv and helena-intel authored Dec 16, 2024
1 parent 4028332 commit 40ee6cd
Showing 1 changed file with 1 addition and 1 deletion.
2 changes: 1 addition & 1 deletion docs/source/openvino/export.mdx
Original file line number Diff line number Diff line change
Expand Up @@ -78,7 +78,7 @@ Optional arguments:
--ratio RATIO A parameter used when applying 4-bit quantization to control the ratio between 4-bit and 8-bit
quantization. If set to 0.8, 80% of the layers will be quantized to int4 while 20% will be
quantized to int8. This helps to achieve better accuracy at the sacrifice of the model size
and inference latency. Default value is 1.0. Note: If dataset is provided, and the ration is
and inference latency. Default value is 1.0. Note: If dataset is provided, and the ratio is
less than 1.0, then data-aware mixed precision assignment will be applied.
--sym Whether to apply symmetric quantization
--group-size GROUP_SIZE
Expand Down

0 comments on commit 40ee6cd

Please sign in to comment.