Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

WIP - Switch the MMQnA LVM for Gaudi to use the llava microservice instead of tgi-gaudi #10

Closed
wants to merge 2 commits into from

Conversation

dmsuehir
Copy link
Collaborator

@dmsuehir dmsuehir commented Nov 1, 2024

Description

After adding audio data prep/ingestion, we found that the LVM used for MMQnA on Gaudi wasn't working when the user query only contained text (no images). After investigating this I found that I wasn't getting reasonable results with text-only prompts to tgi-gaudi (even outside of using other OPEA services). Xeon uses the llava microservice from GenAIComps for the LVM instead, which uses the transformers image-to-text pipeline in the backend. This llava microservice also supports Gaudi. In this PR I am switching the LVM microservice for MMQnA for Gaudi to use llava instead of tgi-gaudi, in order to be able to get better results with text only queries. Another thing to note is that it doesn't seem like I'm able to use LLaVa-NeXT
models with microservice, so the model has to be changed as well.

Issues

RFC: https://github.com/opea-project/docs/blob/main/community/rfcs/24-10-02-GenAIExamples-001-Image_and_Audio_Support_in_MultimodalQnA.md

Type of change

List the type of change like below. Please delete options that are not relevant.

  • Bug fix (non-breaking change which fixes an issue)

Dependencies

List the newly introduced 3rd party dependency if exists.

Tests

Describe the tests that you ran to verify your changes.

@dmsuehir dmsuehir closed this Nov 4, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

1 participant