feat: implement 128bit fft #3

sarah-quinones · 2023-02-14T14:07:25Z

No description provided.

IceTDrinker

I did not have time to read everything, do you have some paper resource on the f128 format/approach you take ?

Cargo.toml

src/lib.rs

src/fft128/mod.rs

src/fft128/f128_impl.rs

src/fft128/mod.rs

Cargo.toml

src/lib.rs

IceTDrinker · 2023-03-02T09:33:32Z

src/fft128/f128_impl.rs

+    }
+
+    #[inline(always)]
+    pub(crate) fn _mm256_quick_two_sum(simd: Avx, a: __m256d, b: __m256d) -> (__m256d, __m256d) {


as was discussed, move away from the intel intrisics notations

IceTDrinker · 2023-03-02T09:34:13Z

src/fft128/f128_impl.rs

+
+    impl Avx {
+        #[inline(always)]
+        pub fn _mm256_add_estimate_f128_f128(


as was discussed bench and compare the non estimate versions to be able to have the choice potentially for intrisics to select between estimate and more precise implementations

IceTDrinker · 2023-03-02T09:35:16Z

src/fft128/mod.rs

+}
+
+#[cfg(test)]
+mod tests {


as discussed make sure all permutations are well tested

also let's keep track the CI task to have tests for all the intrisics combinations (cc @soonum) and fix a toolchain version to avoid random lint breakage #6

sarah-quinones · 2023-03-17T19:19:08Z

closed as superseded by #11

sarah-quinones requested a review from IceTDrinker February 14, 2023 14:07

sarah-quinones force-pushed the fft128 branch from 89f2e8d to 4e94b00 Compare February 14, 2023 14:12

IceTDrinker reviewed Feb 14, 2023

View reviewed changes

Cargo.toml Outdated Show resolved Hide resolved

src/lib.rs Show resolved Hide resolved

src/fft128/mod.rs Show resolved Hide resolved

src/fft128/mod.rs Show resolved Hide resolved

src/fft128/f128_impl.rs Show resolved Hide resolved

src/fft128/mod.rs Outdated Show resolved Hide resolved

sarah-quinones force-pushed the fft128 branch from 4e94b00 to 5efab2c Compare February 15, 2023 09:52

IceTDrinker reviewed Feb 17, 2023

View reviewed changes

Cargo.toml Outdated Show resolved Hide resolved

sarah-quinones force-pushed the fft128 branch 2 times, most recently from a96ee67 to eccd8fa Compare February 17, 2023 12:34

tmontaigu reviewed Mar 1, 2023

View reviewed changes

src/lib.rs Outdated Show resolved Hide resolved

feat: implement 128bit fft

430c86f

sarah-quinones force-pushed the fft128 branch from eccd8fa to 430c86f Compare March 1, 2023 15:09

IceTDrinker reviewed Mar 2, 2023

View reviewed changes

sarah-quinones closed this Mar 17, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: implement 128bit fft #3

feat: implement 128bit fft #3

sarah-quinones commented Feb 14, 2023

IceTDrinker left a comment

IceTDrinker Mar 2, 2023

IceTDrinker Mar 2, 2023

IceTDrinker Mar 2, 2023

IceTDrinker Mar 2, 2023

sarah-quinones commented Mar 17, 2023

feat: implement 128bit fft #3

feat: implement 128bit fft #3

Conversation

sarah-quinones commented Feb 14, 2023

IceTDrinker left a comment

Choose a reason for hiding this comment

IceTDrinker Mar 2, 2023

Choose a reason for hiding this comment

IceTDrinker Mar 2, 2023

Choose a reason for hiding this comment

IceTDrinker Mar 2, 2023

Choose a reason for hiding this comment

IceTDrinker Mar 2, 2023

Choose a reason for hiding this comment

sarah-quinones commented Mar 17, 2023