New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

Sign up for GitHub

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Jump to bottom

AgentQnA helm chart deploy update #837

Open

yongfengdu wants to merge 1 commit into opea-project:main from yongfengdu:agentqna

Collaborator

yongfengdu commented Feb 26, 2025

Sync latest changes with GenAIExamples.
Added cpu deployment with smaller model.
Updated README with detailed instructions.
Support using PVC for passing tools configuration. Fix minor issues.

Description

Update agentqna helm charts

Issues

Closed #827
Closed #798
Closed #783
Example 1524
Example 1523

Type of change

List the type of change like below. Please delete options that are not relevant.

Bug fix (non-breaking change which fixes an issue)
New feature (non-breaking change which adds new functionality)
Breaking change (fix or feature that would break existing design and interface)

Dependencies

List the newly introduced 3rd party dependency if exists.

Tests

Manual tested deploy and helm test with gaudi-values.yaml and cpu-values.yaml.


          AgentQnA helm chart deploy update

7e54abf

Sync latest changes with GenAIExamples.
Added cpu deployment with smaller model.
Updated README with detailed instructions.
Support using PVC for passing tools configuration.
Fix minor issues.

Signed-off-by: Dolpher Du <[email protected]>

yongfengdu requested a review from lianhao as a code owner

February 26, 2025 07:30

yongfengdu requested review from poussa and mkbhanda

February 27, 2025 12:01

eero-t reviewed

View reviewed changes

Contributor

eero-t left a comment

Noticed few trivial things.

helm-charts/agentqna/README.md


		Note that this is an example to demonstrate how agent works and tested with prepared data and questions. Using different datasets, models and questions may get different results.

		Agent usually requires larger models to performance better, we used Llama-3.3-70B-Instruct for test, which requires 4x Gaudi devices for local deployment.

Contributor

eero-t Feb 27, 2025

Typo:

Suggested change

      
            Agent usually requires larger models to performance better, we used Llama-3.3-70B-Instruct for test, which requires 4x Gaudi devices for local deployment.
          
            Agent usually requires larger models to perform better, we used Llama-3.3-70B-Instruct for test, which requires 4x Gaudi devices for local deployment.

I guess it could be run (slowly) also on CPU with enough memory?

helm-charts/agentqna/README.md


		Agent usually requires larger models to performance better, we used Llama-3.3-70B-Instruct for test, which requires 4x Gaudi devices for local deployment.

		With helm chart, we also provided option with smaller model(Meta-Llama-3-8B-Instruct) with compromised performance on Xeon CPU only environment for you to try.

Contributor

eero-t Feb 27, 2025

Suggested change

      
            With helm chart, we also provided option with smaller model(Meta-Llama-3-8B-Instruct) with compromised performance on Xeon CPU only environment for you to try.
          
            With helm chart, we also provided option with smaller model (Meta-Llama-3-8B-Instruct) with compromised performance on Xeon CPU only environment for you to try.

helm-charts/agentqna/README.md

               ## Deploy
-              helm install agentqna oci://ghcr.io/opea-project/charts/agentqna --set global.HUGGINGFACEHUB_API_TOKEN=${HUGGINGFACEHUB_API_TOKEN} --set tgi.enabled=True
+              The Deployment includes preparing tools and sql data.

Contributor

eero-t Feb 27, 2025

Suggested change

      
            The Deployment includes preparing tools and sql data.
          
            The Deployment includes preparing tools and SQL data.

helm-charts/agentqna/README.md


		A volume is required to put tools configuration used by agent, and the database data used by sqlagent.

		We'll use hostPath in this readme, which is convenient for single worker node deployment. PVC is recommended in a bigger cluster. If you want to use a PVC, comment out the `toolHostPath` and replace with `toolPVC` in the values.yaml.

Contributor

eero-t Feb 27, 2025

Suggested change

      
            We'll use hostPath in this readme, which is convenient for single worker node deployment. PVC is recommended in a bigger cluster. If you want to use a PVC, comment out the `toolHostPath` and replace with `toolPVC` in the values.yaml.
          
            We'll use hostPath in this readme, which is convenient for single worker node deployment. PVC is recommended in a bigger cluster. If you want to use a PVC, comment out the `toolHostPath` and replace with `toolPVC` in the `values.yaml`.

helm-charts/agentqna/README.md


		We'll use hostPath in this readme, which is convenient for single worker node deployment. PVC is recommended in a bigger cluster. If you want to use a PVC, comment out the `toolHostPath` and replace with `toolPVC` in the values.yaml.

		Create the directory /mnt/tools in the worker node, which is the default in values.yaml. We use the same directory for all 3 agents for easy configuration.

Contributor

eero-t Feb 27, 2025

Suggested change

      
            Create the directory /mnt/tools in the worker node, which is the default in values.yaml. We use the same directory for all 3 agents for easy configuration.
          
            Create the directory `/mnt/tools` in the worker node, which is the default in `values.yaml`. We use the same directory for all 3 agents for easy configuration.

helm-charts/agentqna/variant-openai-values.yaml

+                OPENAI_API_KEY: EMPTY
+                model: "YourModel"
+                # Use OpenAI KEY
+                # llm_engine: openai

Contributor

eero-t Feb 27, 2025

Why these are commented out?

helm-charts/agentqna/variant-openai-values.yaml

Comment on lines +18 to +22

+                OPENAI_API_KEY: EMPTY
+                model: "YourModel"
+                # Use OpenAI KEY
+                # llm_engine: openai
+                # OPENAI_API_KEY: YourOpenAIKey

Contributor

eero-t Feb 27, 2025 •

edited

Loading

These extra key comments are redundant for all of these 3 subcharts.

helm-charts/common/agent/gaudi-tgi-values.yaml

		@@ -6,6 +6,24 @@

Contributor

eero-t Feb 27, 2025

Just in case:

Suggested change

      
            vllm:
          
              enabled: false

helm-charts/common/agent/gaudi-values.yaml

		@@ -6,6 +6,15 @@

Contributor

eero-t Feb 27, 2025

Just in case:

Suggested change

      
            tgi:
          
              enabled: false

helm-charts/common/agent/values.yaml

Comment on lines +30 to +31

		# Uncomment this if you have an tool configuration file
		tools: /home/user/comps/agent/src/tools/custom_tools.yaml

Contributor

eero-t Feb 27, 2025

It's already uncommented?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Reviewers

eero-t eero-t left review comments

lianhao Awaiting requested review from lianhao lianhao is a code owner

poussa Awaiting requested review from poussa

mkbhanda Awaiting requested review from mkbhanda

At least 2 approving reviews are required to merge this pull request.

Labels

None yet