Skip to content

Commit

Permalink
Merge branch 'ns/update-ratio-description' of github.com:nikita-savel…
Browse files Browse the repository at this point in the history
…yevv/optimum-intel into ns/update-ratio-description
  • Loading branch information
nikita-savelyevv committed Dec 17, 2024
2 parents 6c79118 + 40ee6cd commit ecbad58
Showing 1 changed file with 1 addition and 1 deletion.
2 changes: 1 addition & 1 deletion docs/source/openvino/export.mdx
Original file line number Diff line number Diff line change
Expand Up @@ -78,7 +78,7 @@ Optional arguments:
--ratio RATIO A parameter used when applying 4-bit quantization to control the ratio between 4-bit and 8-bit
quantization. If set to 0.8, 80% of the layers will be quantized to int4 while 20% will be
quantized to int8. This helps to achieve better accuracy at the sacrifice of the model size
and inference latency. Default value is 1.0. Note: If dataset is provided, and the ration is
and inference latency. Default value is 1.0. Note: If dataset is provided, and the ratio is
less than 1.0, then data-aware mixed precision assignment will be applied.
--sym Whether to apply symmetric quantization
--group-size GROUP_SIZE
Expand Down

0 comments on commit ecbad58

Please sign in to comment.