Skip to content

Commit

Permalink
Fix typo
Browse files Browse the repository at this point in the history
  • Loading branch information
kimwalisch committed Jun 27, 2024
1 parent 27de40c commit f9d625b
Showing 1 changed file with 5 additions and 5 deletions.
10 changes: 5 additions & 5 deletions README.md
Original file line number Diff line number Diff line change
Expand Up @@ -90,11 +90,11 @@ ARM SVE is a new vector instruction set for ARM CPUs that was first released in
ARM SVE algorithms can be much faster than ARM NEON algorithms which are limited
to 128 bits vector length.

libpopcnt new ARM SVE popcount algorithm is up to 3x faster than its ARM NEON
popcount algorithm. Unfortunately runtime dispatching to ARM SVE is not yet well
supported by the GCC and Clang compilers and libc's. Therefore, by default only
the (portable) ARM NEON popcount algorithm is enabled when using libpopcnt on
ARM CPUs.
libpopcnt's new ARM SVE popcount algorithm is up to 3x faster than its ARM NEON
popcount algorithm (on AWS Graviton3 CPUs). Unfortunately runtime dispatching to
ARM SVE is not yet well supported by the GCC and Clang compilers and libc's.
Therefore, by default only the (portable) ARM NEON popcount algorithm is enabled
when using libpopcnt on ARM CPUs.

To enable libpopcnt's ARM SVE popcount algorithm you need to compile your program
using your compiler's ARM SVE option e.g.:
Expand Down

0 comments on commit f9d625b

Please sign in to comment.