Skip to content
This repository was archived by the owner on Feb 15, 2025. It is now read-only.

EPIC: RAG Part 2 #271

Open
1 of 3 tasks
gphorvath opened this issue Mar 20, 2024 · 0 comments
Open
1 of 3 tasks

EPIC: RAG Part 2 #271

gphorvath opened this issue Mar 20, 2024 · 0 comments
Labels
enhancement New feature or request

Comments

@gphorvath
Copy link

gphorvath commented Mar 20, 2024

Problem Statement:

One of the limitations of Large Language Models (LLMs) is that they are only able to respond to scenarios contained within training data - and training on new data is expensive given the size of the model. Retrieval Augmented Generation (RAG) is a technique to supplement the LLM with new data to enable it to provide more up-to-date responses.

Acceptance Criteria:

  • IDAM for managing access to RAG Data
  • API is compliant with OpenAI endpoints (Chat, Embeddings, Files, Assistants at a minimum).
  • Handle GPT2 (bottleneck) in RAG Backend
  • Better embeddings model (currently Instructor-XL)
  • Smarter / Better RAG

Definition of Done:

Tasks

Preview Give feedback
  1. documentation
  2. documentation
  3. documentation
@gphorvath gphorvath added the enhancement New feature or request label Mar 20, 2024
Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.
Labels
enhancement New feature or request
Projects
None yet
Development

No branches or pull requests

1 participant