Binary maxsim via hamming #191

michaelbridge · 2025-02-17T16:16:38Z

Binary quantization and hamming distance are critical for scaling multi-vector representations (i.e., Colbert).

It looks as though hamming for binary vectors has already been implemented.

While a hamming-based maxsim can be implemented over this with a postgres function per approach here, is this something that might be supported/optimized within the library?

Beyond this, is an unpack_bits operation to convert a binary vector into a float representation (to improve accuracy in a subsequent rerank step) something contemplated?

The text was updated successfully, but these errors were encountered:

michaelbridge · 2025-02-17T16:39:11Z

Ah, looks like RaBitQ is an implementation of binary quant, so perhaps this question is better restated as, how can Colbert/Colpali late interaction be optimized within this framework?

gaocegege · 2025-02-27T09:44:57Z

Hey, maybe you could checkout https://blog.vectorchord.ai/supercharge-vector-search-with-colbert-rerank-in-postgresql

michaelbridge · 2025-02-27T13:37:01Z

Thanks, but I linked that blog post above. Without a multi-vector index, that doesn't scale.

VoVAllen · 2025-02-27T14:19:12Z

We're baking some approach at #197 based on https://github.com/jlscheerer/xtr-warp/tree/main/warp/search. Please stay tuned!

michaelbridge changed the title ~~Binary quantization and hamming distance~~ Binary maxsim via hamming Feb 17, 2025

gaocegege added the type/question 🙋 Further information is requested label Feb 27, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Binary maxsim via hamming #191

Binary maxsim via hamming #191

michaelbridge commented Feb 17, 2025 •

edited

Loading

michaelbridge commented Feb 17, 2025

gaocegege commented Feb 27, 2025

michaelbridge commented Feb 27, 2025

VoVAllen commented Feb 27, 2025

Binary maxsim via hamming #191

Binary maxsim via hamming #191

Comments

michaelbridge commented Feb 17, 2025 • edited Loading

michaelbridge commented Feb 17, 2025

gaocegege commented Feb 27, 2025

michaelbridge commented Feb 27, 2025

VoVAllen commented Feb 27, 2025

michaelbridge commented Feb 17, 2025 •

edited

Loading