This is the project repository for the course DSAN5800: Advanced NLP at Georgetown University, 2024 Fall.
- Yuting Fan ([email protected])
- Peng Li ([email protected])
- Yiwei Qi ([email protected])
In this project, you will build a retrieval-augmented generation (RAG) system that can retrieve relevant documents from a knowledge base and use a language model to generate responses based on the retrieved information.
- Implement a document retriever using dense retrieval (e.g., FAISS).
- Combine the retriever with a language model (e.g., GPT) to generate responses.
- Build a simple web application that satisfies these functional requirements:
- Chat interface
- Document upload
- Document source attribution
- Evaluate retrieval and response generation quality.
- HuggingFace RAG Implementation
- Google Drive Workspace
- LangChain: Build a Retrieval Augmented Generation (RAG) App
- RAG Evaluation
- A well-documented GitHub repository containing your code
- A project report (4-6 pages) outlining your methodology, experiments, and results
- A 15 minute final presentation / demo showcasing your project, followed by a 5 minute Q&A session