Setup and Run

python3 -m venv venv
source venv/bin/activate
pip install -r requirements.txt

CMAKE_ARGS="-DLLAMA_CUDA=on" pip install llama-cpp-python --no-cache-dir --force-reinstall --upgrade  # install llamacpp with cuda

Mixedbread Embedding Model - HuggingFace. Save this model in ai/embedding/models/

Llama3 8B Instruct Language Model - NousResearch/Meta-Llama-3-8B-Instruct. Save this model in ai/llm/models

Make sure you have necessary models specified in config.json in the respective folders and environment variables set in .env file. Then,

python3 app.py