llama-tts : precompute irFFT theta #12514

marcoStocchi · 2025-03-22T12:09:44Z

Added vector of precomputed theta values for inverse real FFT, added irfft_2() which uses the precomputed values.
Benchmark tests on AMD Ryzen 7 3700X: approximate performance gain of the irfft threads of 70%.
Left original irfft() untouched to ease further tests on different machines.

Some benchmark info:

AMD Ryzen 7 3700X 8-Core Processor
GCC 14.2.1 20240912 (Red Hat 14.2.1-3) for x86_64-redhat-linux
Model: outetts-0.2-0.5B-f16.gguf
Wavtk: wavtokenizer-large-75-f16.gguf
5 trials of llama-tts using the original irfft and the irfft2
Text prompt: "This is a test made using a standard configuration."

	irfft (ms)	irfft_2 (ms)
1	529.593	110.258
2	529.553	110.186
3	528.021	111.706
4	528.127	110.320
5	528.475	110.711
avg	528.754	110.636

* tts.cpp : added vector of precomputed theta values for inverse real FFT, added irfft_2() which uses the precomputed values. Benchmark tests on AMD Ryzen 7 3700X: approximate performance gain of the irfft threads of 70%. Left original irfft() untouched to ease further tests on different machines.

github-actions bot added the examples label Mar 22, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

llama-tts : precompute irFFT theta #12514

llama-tts : precompute irFFT theta #12514

marcoStocchi commented Mar 22, 2025

llama-tts : precompute irFFT theta #12514

Are you sure you want to change the base?

llama-tts : precompute irFFT theta #12514

Conversation

marcoStocchi commented Mar 22, 2025

Some benchmark info: