Skip to content

Samples to use Kusto(Azure Data Explorer) as a Vector database for AI embeddings

License

Notifications You must be signed in to change notification settings

Anujh23/azure_kusto_vector

 
 

Repository files navigation

Kusto as a Vector database

Azure Data Explorer aka Kusto is a cloud-based data analytics service that enables users to perform advanced analytics on large datasets in real-time. It is particularly well-suited for handling large volumes of data, making it an excellent choice for storing and searching vectors.

Kusto supports a special data type called dynamic, which can store unstructured data such as arrays and properties bag. Dynamic data type is perfect for storing vector values. You can further augment the vector value by storing metadata related to the original object as separate columns in your table.
Kusto also supports in-built function series_cosine_similarity_fl to perform vector similarity searches.

Get started with Kusto for free.

Kusto_Vector

Getting started with Kusto and Open AI embedding

Demo Scenario

Wiki_embeddings

semantic_search_flow

If you’d like to try this demo, please follow the instructions in the Notebook.

It will allow you to -

  1. Use precomputed embeddings created by OpenAI API.

  2. Store the embeddings in Kusto.

  3. Convert raw text query to an embedding with OpenAI API.

  4. Use Kusto to perform cosine similarity search in the stored embeddings.

You can check the solution with AAD authentication and Kusto queries here.

About

Samples to use Kusto(Azure Data Explorer) as a Vector database for AI embeddings

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • Jupyter Notebook 100.0%