WIP - Switch the MMQnA LVM for Gaudi to use the llava microservice instead of tgi-gaudi #10
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
Description
After adding audio data prep/ingestion, we found that the LVM used for MMQnA on Gaudi wasn't working when the user query only contained text (no images). After investigating this I found that I wasn't getting reasonable results with text-only prompts to tgi-gaudi (even outside of using other OPEA services). Xeon uses the llava microservice from GenAIComps for the LVM instead, which uses the transformers
image-to-text
pipeline in the backend. This llava microservice also supports Gaudi. In this PR I am switching the LVM microservice for MMQnA for Gaudi to use llava instead of tgi-gaudi, in order to be able to get better results with text only queries. Another thing to note is that it doesn't seem like I'm able to use LLaVa-NeXTmodels with microservice, so the model has to be changed as well.
Issues
RFC: https://github.com/opea-project/docs/blob/main/community/rfcs/24-10-02-GenAIExamples-001-Image_and_Audio_Support_in_MultimodalQnA.md
Type of change
List the type of change like below. Please delete options that are not relevant.
Dependencies
List the newly introduced 3rd party dependency if exists.
Tests
Describe the tests that you ran to verify your changes.