-
Notifications
You must be signed in to change notification settings - Fork 6
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Feature: Add support for knowledge graphs #129
Comments
hey, long time! Nice to see knowledge graphs make a comeback. I do agree with the use cases and benefits over the current system.
The vector db based RAG system was supposed to do this but alas. Let's see how graph-rag performs.
That would be sweet, like comparing the documentation with the support tickets and trusting the documentation on conflict.
This is what is most concerning about this method. Embedding generation is super fast but in graph-rag, we need to run prompts on every chunk and some more which wouldn't just be prompt processing but output generation too so it would be much slower. Add the file changes/creations/deletions on top of that for graph update, although this shouldn't be that bad ig.
It uses the text-to-text task processing api in the server as default, which accepts a text input and generates a response. In addition to that we still support llama and ctransformers locally (in container) with a config change.
The issue mentions that document additions are possible now, no deletions it seems. Someone also mentioned https://github.com/circlemind-ai/fast-graphrag which is faster and more accurate to microsoft's implementation according to their benchmark (https://github.com/circlemind-ai/fast-graphrag/blob/main/benchmarks/README.md) so that should not be an issue if we use this.
🚀 let's find a common time when we can meet. When are you usually free? And allow me some time to find more interested people. |
The traditional RAG approach has difficulty in extracting complex relationships or overarching themes from the source material due to chunking and later only retrieving some of these chunks. This limits the usefulness of RAG for more complex and in-depth topics that cannot be solved by retrieving only chunks of the source material. Also, real-world data sources may contain conflicting and unreliable information which may confuse an LLM trying to generate an answer without being aware of the broader context.
Knowledge graphs can help solve this issue by incrementally building a graph structure from the source data where each edge in the graph represents a contextual relationships between separate sets of facts or topics. This methodology allows not only retrieving the relevant chunk of source material but also the relevant context for that source material.
Potential use cases
Difficulties/Limitations
P.S. I'm working 50% currently so I'm available for discussions or just catching up!
EDIT: A good basic explanation of the concepts involved: https://www.youtube.com/watch?v=6vG_amAshTk
The text was updated successfully, but these errors were encountered: