Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Epic: docsite RAG #155

Open
josephjclark opened this issue Jan 24, 2025 · 0 comments
Open

Epic: docsite RAG #155

josephjclark opened this issue Jan 24, 2025 · 0 comments

Comments

@josephjclark
Copy link
Collaborator

This is an epic issue to track requirements for a good, trustworthy RAG system which we can plug into the AI assistant

We have a great prototype - it just needs bringing into line with our latest ideas about the server, testing, and integrating with job chat

  • Port the existing generate_docs_embeddings.py to use the new embeddings framework
    • Use pinecone
    • keep openai for embeddings I guess
  • Port the existing search.py function into a docs_search function, which uses a store in embeddings/
  • analyse and optmise the chunking of the docsite and adaptor APIs, so that the documents we embed are reasonably useful
  • setup github automation to automatically rebuild the embeddings database when the docsite is updated
  • Integrate with job chat and re-work the old job chat prompt
  • Include metadata for each document returned, in such a way that we can present them to the user
  • work out what tests we can add around this to prevent regressions and validate behaviour
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant