Added quantization for OUTETTS #2662

nikita-malininn · 2025-01-16T12:00:57Z

Added quantization via nncf.quantize to the notebook
Quantization with the mixed preset and transformer model type due to model architecture (LLM)
Quantization with the ignored scope due to OpenVINO issues with the optimized SDPA inference
Quality validation of the quantized model only with expert listening is possible because of the nature of the task
Performance validation only with the generate pipeline is possible because of the model architecture

Ticket: 157133

review-notebook-app · 2025-01-16T12:01:02Z

Check out this pull request on

See visual diffs & provide feedback on Jupyter Notebooks.

Powered by ReviewNB

…no_notebooks into nm/update_outetts

MaximProshin · 2025-01-27T11:53:29Z

@KodiaqQ , what results do you get with the quantized model vs original on your machine?

nikita-malininn · 2025-01-28T09:32:24Z

@KodiaqQ , what results do you get with the quantized model vs original on your machine?

FP model generate time: 3.926095366012305
INT model generate time: 2.8104791679652408

upd. Recalculated times with ignored scope.

…no_notebooks into nm/update_outetts

alexsu52 · 2025-02-06T13:13:18Z

notebooks/outetts-text-to-speech/outetts-text-to-speech.ipynb

+    "hf_model = OVHFModel(model_dir, device.value).model\n",
+    "dataset = nncf.Dataset(libritts, partial(transform_fn, interface=interface))\n",
+    "\n",
+    "quantized_model = nncf.quantize(\n",


I would suggest to use INT4 weight compression with dynamic quantization (A8W4). @KodiaqQ claim that the performance of such model is equal to the performance of the quantized model, but compression reate is higher for A8W4 model.

cc' @MaximProshin

Please share numbers for both cases. If int4 is better, I'm ok to use that method.

We discussed it offline and agreed to keep int8. At the same time the issue with SDPA will be analyzed in #16177. If resolved, we will update the notebook afterwards.

nikita-malininn · 2025-02-10T12:10:52Z

@l-bat, can you review, please? Thanks.

l-bat · 2025-02-10T13:03:22Z

notebooks/outetts-text-to-speech/outetts-text-to-speech.ipynb

@@ -22,6 +22,11 @@
    "- [Run model inference](#Run-model-inference)\n",


Line #36. "__module.model.layers.*.self_attn/aten::scaled_dot_product_attention/ScaledDotProductAttention"
Could you please provide a reason in the description for why we should add this pattern to IgnoredScope?

Reply via ReviewNB

l-bat · 2025-02-10T13:03:22Z

notebooks/outetts-text-to-speech/outetts-text-to-speech.ipynb

@@ -22,6 +22,11 @@
    "- [Run model inference](#Run-model-inference)\n",


Line #3. demo = make_demo(interface)
I think it would be nice to use the quantized model in the interactive demo. Could you please add an option for the user to choose between the original and optimized models?

Reply via ReviewNB

…no_notebooks into nm/update_outetts

eaidova · 2025-02-12T17:24:27Z

notebooks/outetts-text-to-speech/outetts-text-to-speech.ipynb

@@ -22,6 +22,11 @@
    "- [Run model inference](#Run-model-inference)\n",


Line #4. r = requests.get(
please add check that "skip_kernel_extension.py" exists to allow rerun notebook without internet connection

Reply via ReviewNB

eaidova · 2025-02-12T17:26:33Z

@nikita-malininn could you please also fix formatting for updated notebook?
You can find info how to do that here

P.S. after fixing code formatting and file downloading, I'm ready to merge your PR

nikita-malininn added 3 commits January 16, 2025 13:10

Added quantization

5ef8a9e

Fix CI

e60e952

Black updates

a8ddf8e

eaidova approved these changes Jan 16, 2025

View reviewed changes

nikita-malininn marked this pull request as ready for review January 16, 2025 13:04

Missed NNCF

ac0a654

nikita-malininn marked this pull request as draft January 16, 2025 13:12

nikita-malininn added 6 commits January 27, 2025 11:35

Merge branch 'latest' into nm/update_outetts

2a4f1f4

Update ov_outetts_helper.py

1c2a625

Update README.md

644fcd1

Added performance check

8f1e667

Fix CI

2dc32d8

Merge branch 'nm/update_outetts' of https://github.com/KodiaqQ/openvi…

5a2bb7d

…no_notebooks into nm/update_outetts

nikita-malininn marked this pull request as ready for review January 27, 2025 11:38

nikita-malininn added 2 commits January 28, 2025 10:04

Merge branch 'latest' into nm/update_outetts

c93ea44

Generate pipeline validation

8ff0d5a

nikita-malininn marked this pull request as draft January 28, 2025 10:30

nikita-malininn added 4 commits January 30, 2025 14:14

Update with ignored scope

39e8e22

Merge branch 'nm/update_outetts' of https://github.com/KodiaqQ/openvi…

46cfc2c

…no_notebooks into nm/update_outetts

Merge branch 'latest' into nm/update_outetts

d7ecb3c

Update notebook

515cbf8

nikita-malininn marked this pull request as ready for review February 3, 2025 14:12

nikita-malininn added 5 commits February 3, 2025 15:13

Merge branch 'nm/update_outetts' of https://github.com/KodiaqQ/openvi…

75d24ff

…no_notebooks into nm/update_outetts

Update path

753ea59

pip helper usage

955dd73

Merge branch 'latest' into nm/update_outetts

dd41891

Added to ignored list

9a8d99d

nikita-malininn added 3 commits February 4, 2025 11:27

Fix

216991c

Merge branch 'latest' into nm/update_outetts

f057369

Merge branch 'latest' into nm/update_outetts

4caee85

nikita-malininn marked this pull request as draft February 6, 2025 13:10

alexsu52 reviewed Feb 6, 2025

View reviewed changes

Merge branch 'latest' into nm/update_outetts

ca8025d

nikita-malininn marked this pull request as ready for review February 10, 2025 12:10

l-bat reviewed Feb 10, 2025

View reviewed changes

nikita-malininn requested a review from l-bat February 11, 2025 08:14

nikita-malininn added 3 commits February 11, 2025 09:22

Apply comments

069c04e

Merge branch 'nm/update_outetts' of https://github.com/KodiaqQ/openvi…

4ce3374

…no_notebooks into nm/update_outetts

Fix doc

39038b9

l-bat approved these changes Feb 11, 2025

View reviewed changes

Merge branch 'latest' into nm/update_outetts

44d445f

MaximProshin approved these changes Feb 12, 2025

View reviewed changes

eaidova reviewed Feb 12, 2025

View reviewed changes

alexsu52 approved these changes Feb 13, 2025

View reviewed changes

nikita-malininn added 4 commits February 17, 2025 13:05

Merge branch 'latest' into nm/update_outetts

8d2f5de

Apply comments

9f76e28

Merge branch 'latest' into nm/update_outetts

3e37fd3

Merge branch 'latest' into nm/update_outetts

94adb2e

eaidova approved these changes Feb 18, 2025

View reviewed changes

eaidova merged commit 0b4b204 into openvinotoolkit:latest Feb 18, 2025
16 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Added quantization for OUTETTS #2662

Added quantization for OUTETTS #2662

nikita-malininn commented Jan 16, 2025 •

edited

Loading

review-notebook-app bot commented Jan 16, 2025

MaximProshin commented Jan 27, 2025

nikita-malininn commented Jan 28, 2025 •

edited

Loading

alexsu52 Feb 6, 2025

MaximProshin Feb 6, 2025

MaximProshin Feb 11, 2025

nikita-malininn commented Feb 10, 2025

l-bat Feb 10, 2025 •

edited

Loading

nikita-malininn Feb 11, 2025

l-bat Feb 10, 2025 •

edited

Loading

nikita-malininn Feb 11, 2025

eaidova Feb 12, 2025 •

edited

Loading

nikita-malininn Feb 18, 2025

eaidova commented Feb 12, 2025 •

edited

Loading

		@@ -22,6 +22,11 @@
		"- [Run model inference](#Run-model-inference)\n",

Added quantization for OUTETTS #2662

Added quantization for OUTETTS #2662

Conversation

nikita-malininn commented Jan 16, 2025 • edited Loading

review-notebook-app bot commented Jan 16, 2025

MaximProshin commented Jan 27, 2025

nikita-malininn commented Jan 28, 2025 • edited Loading

alexsu52 Feb 6, 2025

Choose a reason for hiding this comment

MaximProshin Feb 6, 2025

Choose a reason for hiding this comment

MaximProshin Feb 11, 2025

Choose a reason for hiding this comment

nikita-malininn commented Feb 10, 2025

l-bat Feb 10, 2025 • edited Loading

Choose a reason for hiding this comment

nikita-malininn Feb 11, 2025

Choose a reason for hiding this comment

l-bat Feb 10, 2025 • edited Loading

Choose a reason for hiding this comment

nikita-malininn Feb 11, 2025

Choose a reason for hiding this comment

eaidova Feb 12, 2025 • edited Loading

Choose a reason for hiding this comment

nikita-malininn Feb 18, 2025

Choose a reason for hiding this comment

eaidova commented Feb 12, 2025 • edited Loading

nikita-malininn commented Jan 16, 2025 •

edited

Loading

nikita-malininn commented Jan 28, 2025 •

edited

Loading

l-bat Feb 10, 2025 •

edited

Loading

l-bat Feb 10, 2025 •

edited

Loading

eaidova Feb 12, 2025 •

edited

Loading

eaidova commented Feb 12, 2025 •

edited

Loading