Skip to content

Latest commit

 

History

History
7 lines (5 loc) · 787 Bytes

README.md

File metadata and controls

7 lines (5 loc) · 787 Bytes

nanoFFT

An attempt to write a minimalistic FFT library using Sande-Tukey and COBRA algorithms in pure C. Only the AVX2-FMA single precision variant supports full vectorization. Partial vectorization is supported for AVX and FMA instruction sets in single and double precision.

The provided benchmarks compare performance with some of the popular FFT libraries for AMD Ryzen 4600h CPU with AVX2 and FMA sets enabled. Generally, the library tends to approximately match FFTW3f in measure mode for the scalar variant, however, its relative performance rapidly diminishes with increasing vector length.

Bench_1 Bench_2