-
Notifications
You must be signed in to change notification settings - Fork 210
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Added more FMA functions with tests #89
Conversation
…8, _mm_fmadd_ps, _mm_addsub_ps, _mm_load_pd _mm_loadu_pd, _mm_store_pd, _mm_storeu_pd
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Rebase latest master branch
Header |
Can we use something like "ifdef USE_FMA"‚ and enable only for those who want it? |
eb8e6ef
to
c902b5e
Compare
Let's close this pull request in favor of #82. |
This pull request is the last split of the original #83.
I have added fmsub_ps, fmnadd_ps, fmnsub_ps, and the associated tests.
I had to lower the input vector in the test functions and use an epsilon of 0.0001f because those functions are less precise than just using mul and add, and they seem to diverge with big float numbers (I use qemu, it might be worth checking on real hardware).