Skip to content

Latest commit

 

History

History
13 lines (9 loc) · 608 Bytes

README.md

File metadata and controls

13 lines (9 loc) · 608 Bytes

asm_rs_bench

To run the benchmarks, you need to have cargo installed and be on an arm platform. Then, you can run the following command:

cargo run --release --bin gemv # to run gemv benchmark
cargo run --release --bin gemm # to run gemm benchmark

The assembly code is taken from llama.cpp.

Note: I'm currently investigating a bug in the input for the quantized functions that means the outputs are off. Update 02/01/2025: No progress, the outputs from the optimized gemm and gemv functions is still off and after much debugging I am no closer as to why. All my progress is in cpp folder.