Mistral via Azure #1678

pholz · 2025-02-05T10:19:28Z

Should it be possible to use a non-openAI model which is however hosted on Azure? Specifically, I would like to use Mistral, and I have it deployed on Azure already. But when running indexing, I keep getting Operation 'chat' failed errors from the fnllm package.

If I look at the model URLs, OpenAI deployments take the shape <baseURL>/openai/deployments/<deploymentname>/chat/completions?api-version=2024-08-01-preview whereas the Mistrall deployment uses <baseURL>/models/chat/completions?api-version=2024-05-01-preview. Since in the settings file I only set the baseURL, I assume that the rest of the API is somehow assumed? Or is there a way to change this too?

The text was updated successfully, but these errors were encountered:

yueqianh · 2025-02-07T02:53:12Z

Non-OpenAI models from Azure AI Foundry use OpenAI chat completions template. Simply run it with the OpenAI Chat template from settings.yaml. I got DeepSeek-R1 from Azure AI Foundry to work that way.

pholz · 2025-02-07T09:41:07Z

I still can't get it to work - I changed the endpoints to reflect the "target URIs" in my LLM and embeddings deployments, but now I get a 401 Unauthorized error. Would you share a sample of your settings file?

yueqianh · 2025-02-10T07:41:51Z

@pholz I've been testing other models in Azure AI Foundry. I realised that while DeepSeek-R1 works fine, other models will not work due to the different API endpoint format.

base_url in OpenAI Chat mode:

DeepSeek-R1: https://[DeepSeek-R1-deploymentname].eastus2.models.ai.azure.com/v1/
Phi-4 and other models: https://[deployment_name].services.ai.azure.com/models/

GraphRAG auto completes the base_url into the following:

DeepSeek-R1: https://[DeepSeek-R1-deploymentname].eastus2.models.ai.azure.com/v1/chat/completions
which is the correct endpoint for accessing DeepSeek-R1 models (no API version)
Phi-4 and other models: https://[deployment_name].services.ai.azure.com/models/chat/completions
which lacks the API version query string required for these models. The full endpoint should be: https://[deployment_name].services.ai.azure.com/models/chat/completions?2024-05-01-preview

Seeking help from the GraphRAG team to add support for these Azure AI Model Inference models. Appreciate any workaround!

yueqianh · 2025-02-11T03:34:04Z

Managed to use LiteLLM to route access to Azure AI Services (Azure AI Studio / Azure AI Foundry). Can give it a try!

pholz · 2025-02-13T09:23:17Z

Thank you, I managed using LiteLLM proxy and a serverless mistral-mini deployment. If anyone else tries this, note that you will probably need to set drop_params: true in the proxy config, otherwise azure will return errors.

yueqianh mentioned this issue Feb 10, 2025

[Bug]: Azure AI Model Inference (not Azure OpenAI) models do not work in GraphRAG #1688

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Mistral via Azure #1678

Mistral via Azure #1678

pholz commented Feb 5, 2025

yueqianh commented Feb 7, 2025

pholz commented Feb 7, 2025

yueqianh commented Feb 10, 2025

yueqianh commented Feb 11, 2025

pholz commented Feb 13, 2025

Mistral via Azure #1678

Mistral via Azure #1678

Comments

pholz commented Feb 5, 2025

yueqianh commented Feb 7, 2025

pholz commented Feb 7, 2025

yueqianh commented Feb 10, 2025

yueqianh commented Feb 11, 2025

pholz commented Feb 13, 2025