chainup-chat-bot

Privatized chatbots based on RAG and Llama3.

简体中文 | English

Technical architecture

Architecture diagram

Key technologies

RAG (Retrieval Enhanced Generation) refers to the optimization of large language model output so that it can reference an authoritative knowledge base outside of the training data source before generating a response. Large language models (LLMs) are trained on massive amounts of data, using billions of parameters to generate raw output for tasks such as answering questions, translating language, and completing sentences. Building on the already powerful capabilities of LLMs, RAG extends them to provide access to domain-specific or organization-specific internal knowledge bases, all without the need to retrain models. It's a cost-effective way to improve LLM output to keep it relevant, accurate, and useful in a variety of contexts.
1. Retrieval-Augmented Generation for Knowledge-Intensive NLP Tasks
2. What is RAG (Retrieval Augmented Generation)?
Ollama Get a framework to get large models up and running quickly。
Llama3 8B Meta's open-source model.
LangChain Help developers easily build applications based on large language models (LLMs).。

Usage

Env set

Install python dependencies

pip install dspy gradio langchain langchain_community langchain_core langchain_huggingface pypdf fastembed chromadb sentence-transformers pandas openpyxl

Ollama

see：https://github.com/ollama/ollama

# 1. install Ollama: https://github.com/ollama/ollama

# 2. Run ollama
ollama serve

# 3. Download llama3 8B
ollama pull llama3

# 4. run
ollama run llama3

Prepare data

Supported data types：

json（training_jsons）
pdf（training_pdfs）
xlsx（training_xlsx）
tweets @see https://github.com/chainupcloud/twitter-scan

The data is loaded into the vector database ChromaDB

# Convert xlsx file to json
python training_xlsx_to_json.py

# Create a local ChromaDB (/db in the root directory of the project)
python create_chroma_collection.py

# Load training data to ChromaDB (new files can be added at any time)
python load_data.py

Run Chatbot

python chatbot.py

After running, open the browser to access: http://localhost:7860, which comes with a web UI interface.

Name		Name	Last commit message	Last commit date
Latest commit History 22 Commits
backup		backup
diagrams		diagrams
training_jsons		training_jsons
training_pdfs		training_pdfs
training_xlsx		training_xlsx
.gitignore		.gitignore
LICENSE		LICENSE
README-zh.md		README-zh.md
README.md		README.md
chatbot.py		chatbot.py
config.json		config.json
create_chroma_collection.py		create_chroma_collection.py
load_data.py		load_data.py
query_history.py		query_history.py
requirements.txt		requirements.txt
training_xlsx_to_json.py		training_xlsx_to_json.py
training_xlsx_to_jsonl.py		training_xlsx_to_jsonl.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

chainup-chat-bot

Technical architecture

Architecture diagram

Key technologies

Usage

Env set

Prepare data

The data is loaded into the vector database ChromaDB

Run Chatbot

About

Releases

Packages

Contributors 3

Languages

License

chainupcloud/chainup-chat-bot

Folders and files

Latest commit

History

Repository files navigation

chainup-chat-bot

Technical architecture

Architecture diagram

Key technologies

Usage

Env set

Prepare data

The data is loaded into the vector database ChromaDB

Run Chatbot

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 3

Languages

Packages