-
Notifications
You must be signed in to change notification settings - Fork 5.4k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Euclidean distance and Cosine similarity functions on dense vectors #23982
base: master
Are you sure you want to change the base?
Conversation
…tions for dense arrays
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
LGTM! (docs)
Pull branch, local docs build, review - looks good. Thanks for the doc!
Thanks for the contribution! A few things to help your PR be ready to merge:
|
Thanks for the release note entry! A couple of suggestions to consider, to follow the Order of changes in the Release Note Guidelines.
|
Saved that user @snash4 is from IBM |
Description
Added two functions (Euclidean Distance and Cosine Similarity) on dense vectors.
Motivation and Context
feature requested here
#23981
Impact
Added two function for the feature:
Test Plan
Inserted a vector dataset (SIFT benchmark ) in an Iceberg table and performed top-k similarity search on both functions. The table has embedding_id and vector columns. Vector column is an Array type. The functions return the top-k similar vectors to the query vector.
Added the relevant test cases
Contributor checklist
Release Notes
Please follow release notes guidelines and fill in the release notes below.