feat: full-stack RAG pipeline for PDF ingestion and contextual chat capabilities #28

Nyumat · 2024-11-09T04:30:18Z

This pull request implements a full-stack retrieval-augmented generation (RAG) ingestion pipeline and querying capability. As the core functionality of Beavs AI, this will be a constant work in progress. Although, we can finally say: We did it 🚀

Demo

beavsai_demo.mov

How It Works

PDF Upload Workflow

File Upload: User uploads a PDF via the UI.
Presigned URL: Backend generates a Cloudflare R2 presigned URL for secure access.
File Parsing: PDF is parsed into text chunks using WebPDFLoader.
Chunking: Text is split into chunks with RecursiveCharacterTextSplitter.
Embedding Creation: Chunks are embedded with OpenAIEmbeddings.
Pinecone Storage: Embeddings are upserted into Pinecone.
Database Update: documentIds and isIndexed fields in the course_materials table are updated.

Chat Context Workflow

File Selection: User selects a previously uploaded PDF when starting a chat.
Message Retrieval: Latest user message is retrieved for querying.
Embedding Search: Message is embedded and searched in Pinecone for relevant chunks.
Context Preparation: Retrieved chunks are combined into a file context.
Chat Session: The chat session starts using from the PDF metadata (fileName).

Key Changes

Dependencies: Added @langchain/pinecone and removed langchain.
Schema: Added documentIds and isIndexed fields to course_materials.
API Updates: Enhanced embeddings and chat routes for PDF processing and context retrieval.
Components: Updated chat components and actions for file-based context support.
Utilities: Added PDF parsing utilities and improved Pinecone and OpenAI clients.

Next Steps

Improve error handling and performance.
Support multi-tenant use relevant info.
Support sources for the AI response.
Allow instant chat to query across all documents.

Note

This foundational RAG pipeline allows Beavs AI to provide document-based contextual chats, enhancing the user interactions and responses.

Nyumat · 2024-11-10T02:43:43Z

Going to clean up the code a bit to help prepare for our next meeting, where we'll do a deep dive of this implementation!

Nyumat added 11 commits November 8, 2024 19:34

db: store pinecone document IDs and isIndexed flag

ae011ee

deps: add @langchain/pinecone

10f2e39

example: PDF Parsing with langchain's web loader

7065a5c

pinecone: expand index metadata types

74093c0

openai: export singletons and use langchain's openai obj

d007ba3

ui: pass fileName as key for file context

7111e87

format: run format on init-chat-flow

5c6420f

fix: redirect to chat on fileContext creation

cb5d6d4

fix: use fileID to get fileName in createChat action

7341a02

feat: stable - retrieval augmentation generation implementation

63e0e65

docs: clean up documentation

5443fb9

Nyumat added the critical Breaking change/feature label Nov 9, 2024

Nyumat self-assigned this Nov 9, 2024

Nyumat linked an issue Nov 9, 2024 that may be closed by this pull request

Research and set up vector database system #3

Closed

chore: extract prompt into factory method

6e910f8

Nyumat added 7 commits November 9, 2024 23:59

assets: add bot img

64c2db4

ui: update not found page text color

6d58d6b

cloudflare: add loadDocumentFromURL utility and update name

06dcda0

refactor: fname casing and group actions into methods

49b7eae

pinecone: retrieval augmentation impl

256aa16

ui: impl file stats context and api

9c6cc47

refactor: clean up code and modularize RAG actions

b2445c0

owenkrause approved these changes Nov 18, 2024

View reviewed changes

owenkrause merged commit 84fbe2b into main Nov 18, 2024
1 check passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: full-stack RAG pipeline for PDF ingestion and contextual chat capabilities #28

feat: full-stack RAG pipeline for PDF ingestion and contextual chat capabilities #28

Nyumat commented Nov 9, 2024

Nyumat commented Nov 10, 2024

feat: full-stack RAG pipeline for PDF ingestion and contextual chat capabilities #28

feat: full-stack RAG pipeline for PDF ingestion and contextual chat capabilities #28

Conversation

Nyumat commented Nov 9, 2024

Demo

How It Works

PDF Upload Workflow

Chat Context Workflow

Key Changes

Next Steps

Nyumat commented Nov 10, 2024