Demo: RAG with MariaDB Vector (dev version)

A Python script that implements Retrieval-Augmented Generation (RAG) on the MariaDB Knowledge Base using MariaDB's vector database capabilities, OpenAI's embedding model, and OpenAI's LLM.

This is my dev version with scraping, time recording, prints vector search results, etc. The simplified version is available at https://github.com/MariaDB/demos

Features

Scrapes MariaDB Knowledge Base articles
Converts HTML content to markdown format
Chunks articles into manageable segments
Creates embedding for each chunk using OpenAI's text-embedding-3-small model
Stores content and embeddings in MariaDB vector database
Takes a "question" as input
Searches for most relevant chunks using vector similarity
Generates responses using LLM (GPT-4) with relevant KB context

Prerequisites

Docker
Python 3.x (developed with 3.13)
OpenAI API key
MariaDB 11.7.1

Setup

Start MariaDB 11.7.1 with Docker:

docker run -p 127.0.0.1:3306:3306  --name mdb_1171 -e MARIADB_ROOT_PASSWORD=Password123! -d quay.io/mariadb-foundation/mariadb-devel:11.7

If needed, access MariaDB with:

docker exec -it mdb_1171 mariadb --user root -pPassword123!

If needed, install Docker with:

brew install docker

Add your OpenAI API key to your environment variables. Create a key at https://platform.openai.com/api-keys.

export OPENAI_API_KEY='your-key-here'

Set up a virtual environment

python -m venv venv
source venv/bin/activate  # On Unix/macOS

Install the required Python packages with pip

pip install -r requirements.txt

Usage

Running the following at the moment does everything from setting up the database, to printing a LLM response to a question.

python mariadb_kb_rag.py

Possible next todos

Separate preparation and question asking for easier running
Test with larger amount of KB data
Try with other embedding and LLM models
Graphical GUI somehow
Separate preparation and running questions to own python files
Test other embedding and LLM models
Add a graphical GUI with flask, vue, streamlit, gradio, etc.
Improve chunking solution
Try out more advanced chunking solutions, like https://github.com/bhavnicksm/chonkie/

Resources used to put this together

Cursor https://www.cursor.com/ with Claude Sonnet 3.5
MariaDB Vector https://mariadb.com/kb/en/vector-overview/
Installing and Using MariaDB via Docker https://mariadb.com/kb/en/installing-and-using-mariadb-via-docker/
OpenAI Embeddings https://platform.openai.com/docs/guides/embeddings
OpenAI Chat Completions https://platform.openai.com/docs/guides/text-generation

Name		Name	Last commit message	Last commit date
Latest commit History 9 Commits
README.md		README.md
example.txt		example.txt
mariadb_kb_rag.py		mariadb_kb_rag.py
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Demo: RAG with MariaDB Vector (dev version)

Features

Prerequisites

Setup

Usage

Possible next todos

Resources used to put this together

About

Releases

Packages

Contributors 2

Languages

robertsilen/mariadb-kb-rag

Folders and files

Latest commit

History

Repository files navigation

Demo: RAG with MariaDB Vector (dev version)

Features

Prerequisites

Setup

Usage

Possible next todos

Resources used to put this together

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages