Apply NEF and FFF for RAG #52

MrCsabaToth · 2024-09-17T05:15:52Z

Named Entity Filtering (https://blog.cubed.run/eliminating-hallucinations-lesson-1-named-entity-filtering-nef-5f5956d748e0) and Fully Formatted Facts (https://medium.com/@JamesStakelum/the-end-of-ai-hallucinations-a-breakthrough-in-accuracy-for-data-engineers-e67be5cc742a) along Noun-Phrase Dominance supposed to greatly decrease hallucinations and increase RAG performance (in terms of preciseness and correctness).

I've seen even earlier BERT-based and other few models which were able to generate tags for chunks automatically. These tags should be stored as meta-data for the RAG and should be utilized for filtering to avoid problems depicted in the above articles.

We should also tune the resolver prompt we use for conversation piece embedding preparation to include instructions about "avoid any ambiguity" besides resolving / unfolding contextual references.

…prompt #52

MrCsabaToth added enhancement New feature or request RAG Retrieval Augmented Generation related labels Sep 17, 2024

MrCsabaToth added a commit that referenced this issue Sep 30, 2024

As a precursor for FFF and NEF add ambiguity removal to the resolver …

6b71dc4

…prompt #52

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Apply NEF and FFF for RAG #52

Apply NEF and FFF for RAG #52

MrCsabaToth commented Sep 17, 2024

Apply NEF and FFF for RAG #52

Apply NEF and FFF for RAG #52

Comments

MrCsabaToth commented Sep 17, 2024