add int8

huggingface · Oct 4, 2023 · 0654305 · 0654305
1 parent 09c2e00
commit 0654305
Showing 1 changed file with 7 additions and 0 deletions.
diff --git a/README.md b/README.md
@@ -75,6 +75,13 @@ It is possible to export your model to the [OpenVINO](https://docs.openvino.ai/2
 optimum-cli export openvino --model distilbert-base-uncased-finetuned-sst-2-english ov_distilbert
 ```
 
+To apply int8 quantization on your model weights and keep the activations in floating point precision, you can add `--int8`:
+
+```plain
+optimum-cli export openvino --model distilbert-base-uncased-finetuned-sst-2-english --int8 ov_distilbert
+```
+
+
 #### Inference:
 
 To load a model and run inference with OpenVINO Runtime, you can just replace your `AutoModelForXxx` class with the corresponding `OVModelForXxx` class.