Accelerated Vector Search

In this project, we are utilizing the Intel AMX (Advanced Matrix Extensions) accelerator accelerate Inner Product and Euclidean distance calculations in vector search algorithms. This project is very much a WIP at this point. We have a very basic implementation done so far and we currently accelerate only brute force searches. Please follow the instructions below to use and contribute to our project.

Building from source

Note: An Intel processor from the Sapphire Rapids lineup is required for this project.

Install the oneDNN library.

git clone https://github.com/oneapi-src/oneDNN
cd oneDNN/
mkdir build/
cd build/
cmake -DCMAKE_INSTALL_PREFIX=/usr/local -DCMAKE_BUILD_TYPE=Release ..
sudo make -j$(nproc) install

Build the project.

./build.sh

Run the example with verbose logging.

export LD_LIBRARY_PATH=/usr/local/lib
export ONEDNN_VERBOSE=1
./avs_example -d 16 -k 10 -b 2048 --nd 8192 --nq 4096

References

Exploring Intel AMX for Large Language Model Inference
Highly Efficient Self-checking Matrix Multiplication on Tiled AMX Accelerators
Fast Matrix Multiplication via Compiler-only Layered Data Reorganization and Intrinsic Lowering

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

README.md

README.md

Accelerated Vector Search

Building from source

References

Files

README.md

Latest commit

History

README.md

File metadata and controls

Accelerated Vector Search

Building from source

References