Add small optimizations for QuantizeLinear #532

robertknight · 2025-01-11T08:38:18Z

Add a couple of small and easy optimizations to QuantizeLinear. This also benefits DynamicQuantizeLinear.

Replace division with multiplication by reciprocal
Remove unnecessary clamp-ing as Rust's as operator does this for us

This is ~15% faster in a test with ModernBERT. It should be possible to do much better with SIMD, but that will come later.

Optimize QuantizeLinear and DynamicQuantizeLinear by replacing divisions with multiplication by a pre-computed reciprocal.

Rust's `as` operator already does the desired saturating cast when converting from f32 -> i8 / u8.

robertknight added 2 commits January 10, 2025 23:51

Replace division by multiplication-by-reciprocal in QuantizeLinear

ba52731

Optimize QuantizeLinear and DynamicQuantizeLinear by replacing divisions with multiplication by a pre-computed reciprocal.

Remove unnecessary clamping when quantizing

0aa771e

Rust's `as` operator already does the desired saturating cast when converting from f32 -> i8 / u8.

robertknight merged commit e3eb30f into main Jan 11, 2025
2 checks passed

robertknight deleted the quantize-linear-reciprocal branch January 11, 2025 08:45

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add small optimizations for QuantizeLinear #532

Add small optimizations for QuantizeLinear #532

robertknight commented Jan 11, 2025 •

edited

Loading

Add small optimizations for QuantizeLinear #532

Add small optimizations for QuantizeLinear #532

Conversation

robertknight commented Jan 11, 2025 • edited Loading

robertknight commented Jan 11, 2025 •

edited

Loading