Add ADR for what repository will contain RAG #163

jwm4 · 2024-12-06T23:50:22Z

No description provided.

franciscojavierarceo · 2024-12-07T01:28:17Z

docs/retrieval-augmented-generation/rag-repo.md

+
+- There will be a new repository for RAG.
+- It will be located at https://github.com/instructlab/retrieval-augmented-generation
+- By mid-January, it will be available and working but not integrated with InstructLab.


is there some rationale behind these two dates? just to hold ourselves accountable?

The dates are driven mainly by availability of people to do the work. The mid-March elements of the plan will require a lot of coordination with the maintainers of the core instructlab/instructlab repo. Those people are mostly already committed to other activities through mid-January.

docs/retrieval-augmented-generation/rag-repo.md

khaledsulayman · 2024-12-10T21:13:24Z

docs/retrieval-augmented-generation/rag-repo.md

+- By mid-March, it will be integrated with InstructLab with the new repository being invoked by the core repository and maybe also by the SDG repository.
+- Eventually, it will be integrated with InstructLab with the new repository being invoked only by the core repository.


I'm not sure I agree with necessarily assigning dates to these efforts from now. I think it does make sense to split up these efforts into phases, and then we can internally scope out work for the desired releases.

I think the dates are important, but I am fine with dropping them from this document and using other venues to get aligned around these dates. I will do that.

khaledsulayman · 2024-12-10T21:35:11Z

docs/retrieval-augmented-generation/rag-repo.md

+  - Con: Many things we will want to do to add advanced functionality to make RAG more effective will require changes to both indexing and run-time RAG.  If those components are split across multiple repositories, that will make delivering such changes more complicated.
+- Put the indexing and run-time RAG code in <https://github.com/instructlab/instructlab> (core)
+  - Pro: This has the advantage of not adding any new dependencies.
+  - Pro: However, since the existing document processing is in SDG, the flow for indexing for RAG would be a bit complicated (i.e., it starts with a CLI call handled by the core repo then goes to SDG for some of the document processing and then back to the core for vector database indexing).   That drawback will be eliminated if/when the document processing moves into the core repository.


not sure it's an issue at all that it's making a call to SDG since the instructlab/instructlab dependencies contain SDG anyway. We don't expect users to be using ilab without SDG

however, I do agree with the below cons and don't think it's best to store any of this in instructlab/instructlab if we want to advertise this as 'our RAG solution'

not sure it's an issue at all that it's making a call to SDG since the instructlab/instructlab dependencies contain SDG anyway. We don't expect users to be using ilab without SDG

The issue is not the dependencies, it is the flow -- if document processing starts in core and then moves to SDG and then moves back to core, that flow will be more difficult to understand and maintain than if it was all in one place. This should be labeled as "con" , not "pro". I will fix that and add some more text to clarify what my concern is.

Core does serve as a sort of orchestrator so I'm not worried about that, but making Core required for SDG to interact with RAG may not be preferred - a definite con

docs/retrieval-augmented-generation/rag-repo.md

nathan-weinberg · 2024-12-11T18:57:19Z

docs/retrieval-augmented-generation/rag-repo.md

+  - Con: Many things we will want to do to add advanced functionality to make RAG more effective will require changes to both indexing and run-time RAG.  If those components are split across multiple repositories, that will make delivering such changes more complicated.
+- Put the indexing and run-time RAG code in <https://github.com/instructlab/instructlab> (core)
+  - Pro: This has the advantage of not adding any new dependencies.
+  - Pro: However, since the existing document processing is in SDG, the flow for indexing for RAG would be a bit complicated (i.e., it starts with a CLI call handled by the core repo then goes to SDG for some of the document processing and then back to the core for vector database indexing).   That drawback will be eliminated if/when the document processing moves into the core repository.


Core does serve as a sort of orchestrator so I'm not worried about that, but making Core required for SDG to interact with RAG may not be preferred - a definite con

docs/retrieval-augmented-generation/rag-repo.md

anastasds · 2024-12-11T19:05:51Z

I think I am settling on advocating on a prototyping phase working within a submodule in the instructlab repo to figure out things that are shared concerns across the product like config, prompt template management, model interactions, model management, and more, and explicitly intend to extract something independently deployable into its own repo later.

It can be very difficult to draw component boundaries from the outset - things like the ones I listed above will need to be figured out across both retrieval and inference;
if we start with a submodule in the core repo, I think it's highly likely that we can extract it into its own repo in a cleaner way later.

What I want to avoid is having two places where there is config management, two places where prompt templates are defined, two places where there is model management (since we'll have embedding models too), and so on.

A directory level CODEOWNERS in the instructlab repo should give us the autonomy needed during the design exploration phase of things.

nathan-weinberg

This looks good to me - @anastasds @jwm4 appreciate all the work y'all put into this! - I'd like @cdoern to approve as well but going ahead and giving this my sign-off

franciscojavierarceo reviewed Dec 7, 2024

View reviewed changes

anastasds reviewed Dec 10, 2024

View reviewed changes

docs/retrieval-augmented-generation/rag-repo.md Outdated Show resolved Hide resolved

nathan-weinberg reviewed Dec 10, 2024

View reviewed changes

docs/retrieval-augmented-generation/rag-repo.md Outdated Show resolved Hide resolved

khaledsulayman reviewed Dec 10, 2024

View reviewed changes

nathan-weinberg reviewed Dec 11, 2024

View reviewed changes

jwm4 changed the title ~~Add ADR for New repository for RAG~~ Add ADR for what repository will contain RAG Dec 13, 2024

anastasds approved these changes Dec 13, 2024

View reviewed changes

nathan-weinberg approved these changes Dec 13, 2024

View reviewed changes

nathan-weinberg requested a review from cdoern December 13, 2024 17:53

khaledsulayman approved these changes Dec 13, 2024

View reviewed changes

jwm4 closed this Dec 13, 2024

jwm4 force-pushed the jwm4-rag-repo branch from 37118af to 1102817 Compare December 13, 2024 19:14

jwm4 mentioned this pull request Dec 13, 2024

Second attempt to submit RAG repo location ADR because the last one failed #165

Merged

mairin mentioned this pull request Dec 17, 2024

InstructLab Maintainer nomination for Anastas Stoyanovsky instructlab/instructlab#2932

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add ADR for what repository will contain RAG #163

Add ADR for what repository will contain RAG #163

jwm4 commented Dec 6, 2024

franciscojavierarceo Dec 7, 2024

jwm4 Dec 10, 2024

khaledsulayman Dec 10, 2024

jwm4 Dec 11, 2024

khaledsulayman Dec 10, 2024

khaledsulayman Dec 10, 2024

jwm4 Dec 11, 2024 •

edited

Loading

nathan-weinberg Dec 11, 2024

nathan-weinberg Dec 11, 2024

anastasds commented Dec 11, 2024 •

edited

Loading

nathan-weinberg left a comment

		- By mid-March, it will be integrated with InstructLab with the new repository being invoked by the core repository and maybe also by the SDG repository.
		- Eventually, it will be integrated with InstructLab with the new repository being invoked only by the core repository.

Add ADR for what repository will contain RAG #163

Add ADR for what repository will contain RAG #163

Conversation

jwm4 commented Dec 6, 2024

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

jwm4 Dec 11, 2024 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

anastasds commented Dec 11, 2024 • edited Loading

nathan-weinberg left a comment

Choose a reason for hiding this comment

jwm4 Dec 11, 2024 •

edited

Loading

anastasds commented Dec 11, 2024 •

edited

Loading