Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

How to use RAG with images and documents? #7193

Open
2 tasks done
djaffer opened this issue Nov 12, 2024 · 3 comments
Open
2 tasks done

How to use RAG with images and documents? #7193

djaffer opened this issue Nov 12, 2024 · 3 comments
Labels
auto:question A specific question about the codebase, product, project, or how to use a feature

Comments

@djaffer
Copy link
Contributor

djaffer commented Nov 12, 2024

Checklist

  • I added a very descriptive title to this issue.
  • I included a link to the documentation page I am referring to (if applicable).

Issue with current documentation:

How to use RAG with images and documents both together?

Idea or request for content:

No response

@dosubot dosubot bot added the auto:question A specific question about the codebase, product, project, or how to use a feature label Nov 12, 2024
Copy link

dosubot bot commented Nov 12, 2024

Currently, there isn't specific documentation or examples in LangChainJS that demonstrate using Retrieval-Augmented Generation (RAG) with both images and documents together. However, LangChainJS does support handling multimodal data, including images and text, separately. You can integrate image data by using models that accept image inputs, such as by passing images as byte strings or URLs in content blocks. For text documents, you can follow the standard RAG process involving indexing, retrieval, and generation.

To combine these in a RAG pipeline, you would need to handle the image and text data separately and then integrate their outputs. This might involve retrieving relevant text documents and processing images independently, then using the results together in a language model to generate a response. Unfortunately, specific tools or methods for combining these in a single RAG workflow aren't detailed in the current LangChainJS documentation.

To continue talking to Dosu, mention @dosu.

@erashishgujalwar
Copy link

i tried keeping img tag in the context, but in the response, LLLM is skipping the img tag. multiple tuning is not helping. Not sure how t o proceed.

@djaffer
Copy link
Contributor Author

djaffer commented Dec 20, 2024

I saw langchain example for Python but not on js.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
auto:question A specific question about the codebase, product, project, or how to use a feature
Projects
None yet
Development

No branches or pull requests

2 participants