💬 RAG based knowledge chatbot ✨

This Retrieval-Augmented Generation (RAG) application utilizes Azure Document Intelligence for multi-format document processing, OpenAI for LLM and Embedding, Milvus Lite for vector database for storing and retrieving document embeddings, and Streamlit for the user interface.

Libraries to Install

pip install streamlit milvus langchain openai python-dotenv

Project Setup

1. Clone the Repository

git clone https://github.com/tanyaaton/RAG-based-knowledge-chatbot
cd RAG-based-knowledge-chatbot

2. Install Required Libraries

Install all required libraries using requirements.txt file:

pip install -r requirements.txt

additionally also install the following libraries

pip install -U pymilvus
pip install -U langchain-community

This will ensure all necessary packages are available for running the RAG application, including document processing, vector storage, and LLM integration.

2. Set Up Environment Variables

Create a .env file in the root directory to store sensitive information such as API keys and endpoints:

OpenAI sign up here
Azure Document Intelligence sign up here


# OpenAI API Key
OPENAI_API_KEY=<your-openai-api-key>

# Azure Document Intelligence
AZURE_DOC_ENDPOINT=<your-azure-doc-intelligence-endpoint>
AZURE_DOC_KEY=<your-azure-doc-intelligence-key>

3. Configure Milvus Lite

Milvus Lite can be installed and run locally to manage vector embeddings for the document chunks. Follow the Milvus Lite installation guide for detailed instructions.

After installation, configure the Milvus connection in the connection.py file to match your setup.

4. Running the Application

First run the following command to activate Milvus Lite:

milvus-server --proxy-port 19530

Open new terminal. Tostart the Streamlit application, run:

streamlit run app.py

The app interface will load in your default web browser.

Key Functionalities

Document Upload: Users can upload a new document or select previously uploaded documents from the session.
Document Processing: Documents are processed using Azure Document Intelligence, and text is split into structured chunks.
Embedding and Storage: The content chunks are embedded using OpenAI’s embedding model and stored in Milvus.
Question Answering: Users can ask questions, and the app will retrieve relevant document chunks to generate an answer.

Additional Notes

Ensure your API keys and endpoints are active and have sufficient usage limits to handle document processing and question-answering tasks.

For any troubleshooting or additional setup information, consult the documentation provided for Streamlit, Milvus, Azure, and OpenAI APIs.

Name		Name	Last commit message	Last commit date
Latest commit History 11 Commits
.gitignore		.gitignore
README.md		README.md
app.py		app.py
connection.py		connection.py
document.py		document.py
env_template.txt		env_template.txt
function.py		function.py
prompt.py		prompt.py
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

💬 RAG based knowledge chatbot ✨

Libraries to Install

Project Setup

1. Clone the Repository

2. Install Required Libraries

2. Set Up Environment Variables

3. Configure Milvus Lite

4. Running the Application

Key Functionalities

Additional Notes

About

Releases

Packages

Languages

tanyaaton/RAG-based-knowledge-chatbot

Folders and files

Latest commit

History

Repository files navigation

💬 RAG based knowledge chatbot ✨

Libraries to Install

Project Setup

1. Clone the Repository

2. Install Required Libraries

2. Set Up Environment Variables

3. Configure Milvus Lite

4. Running the Application

Key Functionalities

Additional Notes

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages