[inference] Add support for inference connectors #204541

pgayvallet · 2024-12-17T10:51:56Z

Summary

~~Depends on~~ #200249 merged!

Fix #199082

Add support for the inference stack connectors to the inference plugin (everything is inference)
Adapt the o11y assistant to use the inference-common utilities for connector filtering / compat checking

How to test

1. Starts ES with the unified completion feature flag

yarn es snapshot --license trial ES_JAVA_OPTS="-Des.inference_unified_feature_flag_enabled=true"

2. Enable the inference connector for Kibana

In the Kibana config file:

xpack.stack_connectors.enableExperimental: ['inferenceConnectorOn']

3. Start Dev Kibana

node scripts/kibana --dev --no-base-path

4. Create an inference connector

Go to http://localhost:5601/app/management/insightsAndAlerting/triggersActionsConnectors/connectors, create an inference connector

Type: AI connector

then

Service: OpenAI
API Key: Gwzk... Kidding, please ping someone
Model ID: gpt-4o
Task type: completion

-> save

5. test the o11y assistant

Use the assistant as you would do for any other connector (just make sure the inference connector is selected as the one being used) and do your testing.

pgayvallet · 2024-12-17T11:12:40Z

/ci

pgayvallet · 2024-12-17T11:32:04Z

/ci

pgayvallet · 2024-12-17T12:24:07Z

/ci

pgayvallet · 2024-12-17T12:55:09Z

/ci

pgayvallet · 2024-12-17T13:29:36Z

/ci

…e-connector-support

pgayvallet · 2024-12-18T09:26:17Z

/ci

pgayvallet · 2024-12-18T10:53:47Z

/ci

pgayvallet · 2024-12-18T11:11:11Z

/ci

pgayvallet · 2024-12-18T13:35:13Z

/ci

pgayvallet · 2024-12-18T13:47:05Z

/ci

pgayvallet

Self-review

pgayvallet · 2024-12-18T13:47:58Z

x-pack/platform/packages/shared/ai-infra/inference-common/src/connectors.ts

+export function isSupportedConnector(connector: RawConnector): connector is RawInferenceConnector {
+  if (!isSupportedConnectorType(connector.actionTypeId)) {
+    return false;
+  }
+  if (connector.actionTypeId === InferenceConnectorType.Inference) {
+    const config = connector.config ?? {};
+    if (config.taskType !== COMPLETION_TASK_TYPE) {
+      return false;
+    }
+  }
+  return true;
+}


Checking if a connector is compatible is no longer just based on its type, so I had to create that new check logic.

For inference connectors, we might eventually want to filter based on the provider, but for now I feel like filtering on completion tasks should be sufficient

pgayvallet · 2024-12-18T13:48:17Z

x-pack/platform/plugins/shared/inference/common/connectors.ts

moved to x-pack/platform/packages/shared/ai-infra/inference-common/src/connectors.ts

pgayvallet · 2024-12-18T13:51:44Z

...atform/plugins/shared/inference/server/chat_complete/adapters/inference/inference_adapter.ts

+export const inferenceAdapter: InferenceConnectorAdapter = {
+  chatComplete: ({
+    executor,
+    system,
+    messages,
+    toolChoice,


The adapter working with the inference connector. Very similar to the existing openAI adapter, which is why most of the in/out processing logic has been factorized.

pgayvallet · 2024-12-18T13:54:49Z

...atform/plugins/shared/observability_solution/observability_ai_assistant/common/connectors.ts

deleted because the o11y assistant is now using the helpers exposed from @kbn/inference-common

pgayvallet · 2024-12-18T13:56:02Z

x-pack/plugins/stack_connectors/server/connector_types/inference/inference.ts

      {
        method: 'POST',
        path: `_inference/completion/${this.inferenceId}/_unified`,
        body: { ...params.body, n: undefined }, // exclude n param for now, constant is used on the inference API side
      },
      {
        asStream: true,
+        meta: true,
+        signal: params.signal,


propagating the signal was missing, fixed that

pgayvallet · 2024-12-18T13:57:28Z

x-pack/plugins/stack_connectors/server/connector_types/inference/inference.ts

+    // errors should be thrown as it will not be a stream response
+    if (response.statusCode >= 400) {
+      const error = await streamToString(response.body as unknown as Readable);
+      throw new Error(error);
+    }


Errors from the API call were not caught and simply streamed to the consumer, which was fairly problematic.

Fixed by checking the status code and throwing for error code.

The response format remain unchanged for successful calls (only returning the streaming body)

elasticmachine · 2024-12-18T15:37:23Z

Pinging @elastic/appex-ai-infra (Team:AI Infra)

legrego

Approving to unblock from AI Infra side. Formal reviews to be conducted by O11y and Security teams.

elasticmachine · 2024-12-18T16:03:46Z

Pinging @elastic/obs-ai-assistant (Team:Obs AI Assistant)

YulNaumenko

Thank you for the changes in the AI Connector! LGTM!

neptunian · 2024-12-19T14:11:57Z

When I try doing a test run with inference connector, it works when I try the first time:

screenshot

But if I run it again I get:

screenshot

action execution failure: .inference:62146495-d4c2-40e1-8d0d-6f3e8c9fe57f: asfsdf: an error occurred while running the action: The client noticed that the server is not Elasticsearch and we do not support this unknown product.; retry: true

neptunian · 2024-12-19T14:15:24Z

The Obs AI Assistant is mostly working, except when I ask about alerts or slos:

[ERROR][plugins.observabilityAIAssistant.service] Error: Error calling the inference API
    at createInferenceInternalError (/Users/sandy/dev/elastic/kibana/x-pack/platform/packages/shared/ai-infra/inference-common/src/errors.ts:71:10)
    at /Users/sandy/dev/elastic/kibana/x-pack/platform/plugins/shared/inference/server/chat_complete/adapters/inference/inference_adapter.ts:64:94

This doesn't occur when I use the "regular" OpenAI connector

YulNaumenko · 2024-12-19T16:26:45Z

When I try doing a test run with inference connector, it works when I try the first time:

screenshot

But if I run it again I get:

screenshot

action execution failure: .inference:62146495-d4c2-40e1-8d0d-6f3e8c9fe57f: asfsdf: an error occurred while running the action: The client noticed that the server is not Elasticsearch and we do not support this unknown product.; retry: true

This is a known issue, which ES ML team are working on to fix elastic/elasticsearch#119000

YulNaumenko · 2024-12-19T16:27:44Z

The Obs AI Assistant is mostly working, except when I ask about alerts or slos:

[ERROR][plugins.observabilityAIAssistant.service] Error: Error calling the inference API
    at createInferenceInternalError (/Users/sandy/dev/elastic/kibana/x-pack/platform/packages/shared/ai-infra/inference-common/src/errors.ts:71:10)
    at /Users/sandy/dev/elastic/kibana/x-pack/platform/plugins/shared/inference/server/chat_complete/adapters/inference/inference_adapter.ts:64:94

This doesn't occur when I use the "regular" OpenAI connector

I believe this one is related to another issues ML team is tracking https://github.com/elastic/ml-team/issues/1441

…e-connector-support

pgayvallet · 2024-12-23T08:11:56Z

Confirmed that the errors are coming from elastic/elasticsearch#119000, so I'll consider the PR is fine to merge

kibanamachine · 2024-12-23T09:21:02Z

Starting backport for target branches: 8.x

https://github.com/elastic/kibana/actions/runs/12464415253

elasticmachine · 2024-12-23T09:21:14Z

💚 Build Succeeded

Buildkite Build
Commit: b31e754

Metrics [docs]

Module Count

Fewer modules leads to a faster build time

id	before	after	diff
`inference`	25	26	+1
`observabilityAIAssistant`	119	118	-1
`observabilityAIAssistantApp`	425	426	+1
`observabilityAiAssistantManagement`	381	395	+14
`searchAssistant`	248	262	+14
total			+29

Public APIs missing comments

Total count of every public API that lacks a comment. Target amount is 0. Run node scripts/build_api_docs --plugin [yourplugin] --stats comments for more detailed information.

id	before	after	diff
`@kbn/inference-common`	40	46	+6
`observabilityAIAssistant`	383	379	-4
total			+2

Async chunks

Total size of all lazy-loaded chunks that will be downloaded as the user navigates the app

id	before	after	diff
`observabilityAIAssistantApp`	294.0KB	294.1KB	+128.0B
`searchAssistant`	163.2KB	163.4KB	+128.0B
total			+256.0B

Public APIs missing exports

Total count of every type that is part of your API that should be exported but is not. This will cause broken links in the API documentation system. Target amount is 0. Run node scripts/build_api_docs --plugin [yourplugin] --stats exports for more detailed information.

id	before	after	diff
`@kbn/inference-common`	3	4	+1
`inference`	6	5	-1
total			-0

Page load bundle

Size of the bundles that are downloaded on every page load. Target size is below 100kb

id	before	after	diff
`observabilityAIAssistant`	48.3KB	48.1KB	-247.0B

Unknown metric groups

API count

id	before	after	diff
`@kbn/inference-common`	141	150	+9
`observabilityAIAssistant`	385	381	-4
total			+5

History

💚 Build #262601 succeeded 8789d10
💚 Build #261865 succeeded 45622e9
💔 Build #261758 failed 87ea9bc
💔 Build #261694 failed 3ee0da1
💔 Build #261394 failed f6fb20d
💔 Build #261373 failed fbc5a9a

## Summary ~Depends on~ elastic#200249 merged! Fix elastic#199082 - Add support for the `inference` stack connectors to the `inference` plugin (everything is inference) - Adapt the o11y assistant to use the `inference-common` utilities for connector filtering / compat checking ## How to test **1. Starts ES with the unified completion feature flag** ```sh yarn es snapshot --license trial ES_JAVA_OPTS="-Des.inference_unified_feature_flag_enabled=true" ``` **2. Enable the inference connector for Kibana** In the Kibana config file: ```yaml xpack.stack_connectors.enableExperimental: ['inferenceConnectorOn'] ``` **3. Start Dev Kibana** ```sh node scripts/kibana --dev --no-base-path ``` **4. Create an inference connector** Go to `http://localhost:5601/app/management/insightsAndAlerting/triggersActionsConnectors/connectors`, create an inference connector - Type: `AI connector` then - Service: `OpenAI` - API Key: Gwzk... Kidding, please ping someone - Model ID: `gpt-4o` - Task type: `completion` -> save **5. test the o11y assistant** Use the assistant as you would do for any other connector (just make sure the inference connector is selected as the one being used) and do your testing. --------- Co-authored-by: kibanamachine <[email protected]> (cherry picked from commit 3dcae51)

kibanamachine · 2024-12-23T09:26:07Z

💚 All backports created successfully

Status	Branch	Result
✅	8.x

Note: Successful backport PRs will be merged automatically after passing CI.

Questions ?

Please refer to the Backport tool documentation

…5078) # Backport This will backport the following commits from `main` to `8.x`: - [[inference] Add support for inference connectors (#204541)](#204541)  ### Questions ? Please refer to the [Backport tool documentation](https://github.com/sqren/backport)  --------- Co-authored-by: Pierre Gayvallet <[email protected]>

## Summary ~Depends on~ elastic#200249 merged! Fix elastic#199082 - Add support for the `inference` stack connectors to the `inference` plugin (everything is inference) - Adapt the o11y assistant to use the `inference-common` utilities for connector filtering / compat checking ## How to test **1. Starts ES with the unified completion feature flag** ```sh yarn es snapshot --license trial ES_JAVA_OPTS="-Des.inference_unified_feature_flag_enabled=true" ``` **2. Enable the inference connector for Kibana** In the Kibana config file: ```yaml xpack.stack_connectors.enableExperimental: ['inferenceConnectorOn'] ``` **3. Start Dev Kibana** ```sh node scripts/kibana --dev --no-base-path ``` **4. Create an inference connector** Go to `http://localhost:5601/app/management/insightsAndAlerting/triggersActionsConnectors/connectors`, create an inference connector - Type: `AI connector` then - Service: `OpenAI` - API Key: Gwzk... Kidding, please ping someone - Model ID: `gpt-4o` - Task type: `completion` -> save **5. test the o11y assistant** Use the assistant as you would do for any other connector (just make sure the inference connector is selected as the one being used) and do your testing. --------- Co-authored-by: kibanamachine <[email protected]>

[inference] Add support for inference connectors

978759f

pgayvallet added release_note:skip Skip the PR/issue when compiling release notes v9.0.0 Team:AI Infra AppEx AI Infrastructure Team v8.18.0 labels Dec 17, 2024

pgayvallet added 2 commits December 17, 2024 12:05

start factorizing with openai adapter

ff00096

extract processOpenAIStream

777b914

[CI] Auto-commit changed files from 'node scripts/notice'

986f554

fix export

b1cd411

fix export

fbc5a9a

revert local change

f6fb20d

pgayvallet added 3 commits December 18, 2024 08:30

Merge remote-tracking branch 'upstream/main' into kbn-199082-inferenc…

2388085

…e-connector-support

use real action from PR

ec42ad3

make the thing work with function calling

3ee0da1

self-review first pass

408ac4d

add some unit tests

87ea9bc

fix tests

9581aa2

lint

45622e9

pgayvallet commented Dec 18, 2024

View reviewed changes

pgayvallet marked this pull request as ready for review December 18, 2024 15:37

pgayvallet requested a review from a team as a code owner December 18, 2024 15:37

pgayvallet added the backport:version Backport to applied version labels label Dec 18, 2024

legrego approved these changes Dec 18, 2024

View reviewed changes

botelastic bot added the Team:Obs AI Assistant Observability AI Assistant label Dec 18, 2024

YulNaumenko approved these changes Dec 18, 2024

View reviewed changes

neptunian approved these changes Dec 19, 2024

View reviewed changes

Merge remote-tracking branch 'upstream/main' into kbn-199082-inferenc…

8789d10

…e-connector-support

joemcelroy approved these changes Dec 20, 2024

View reviewed changes

Merge remote-tracking branch 'upstream/main' into kbn-199082-inferenc…

b31e754

…e-connector-support

pgayvallet enabled auto-merge (squash) December 23, 2024 07:56

pgayvallet mentioned this pull request Dec 23, 2024

[inference] Add FTR coverage for the inference adapter/connector #205074

Open

pgayvallet merged commit 3dcae51 into elastic:main Dec 23, 2024
8 checks passed

kibanamachine mentioned this pull request Dec 23, 2024

[8.x] [inference] Add support for inference connectors (#204541) #205078

Merged

legrego self-requested a review January 3, 2025 17:05

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[inference] Add support for inference connectors #204541

[inference] Add support for inference connectors #204541

pgayvallet commented Dec 17, 2024 •

edited by kibanamachine

Loading

pgayvallet commented Dec 17, 2024

pgayvallet commented Dec 17, 2024

pgayvallet commented Dec 17, 2024

pgayvallet commented Dec 17, 2024

pgayvallet commented Dec 17, 2024

pgayvallet commented Dec 18, 2024

pgayvallet commented Dec 18, 2024

pgayvallet commented Dec 18, 2024

pgayvallet commented Dec 18, 2024

pgayvallet commented Dec 18, 2024

pgayvallet left a comment

pgayvallet Dec 18, 2024

pgayvallet Dec 18, 2024

pgayvallet Dec 18, 2024

pgayvallet Dec 18, 2024

pgayvallet Dec 18, 2024

pgayvallet Dec 18, 2024

elasticmachine commented Dec 18, 2024

legrego left a comment

elasticmachine commented Dec 18, 2024

YulNaumenko left a comment

neptunian commented Dec 19, 2024 •

edited

Loading

neptunian commented Dec 19, 2024 •

edited

Loading

YulNaumenko commented Dec 19, 2024

YulNaumenko commented Dec 19, 2024

pgayvallet commented Dec 23, 2024

kibanamachine commented Dec 23, 2024

elasticmachine commented Dec 23, 2024

API count

kibanamachine commented Dec 23, 2024

[inference] Add support for inference connectors #204541

[inference] Add support for inference connectors #204541

Conversation

pgayvallet commented Dec 17, 2024 • edited by kibanamachine Loading

Summary

How to test

pgayvallet commented Dec 17, 2024

pgayvallet commented Dec 17, 2024

pgayvallet commented Dec 17, 2024

pgayvallet commented Dec 17, 2024

pgayvallet commented Dec 17, 2024

pgayvallet commented Dec 18, 2024

pgayvallet commented Dec 18, 2024

pgayvallet commented Dec 18, 2024

pgayvallet commented Dec 18, 2024

pgayvallet commented Dec 18, 2024

pgayvallet left a comment

Choose a reason for hiding this comment

pgayvallet Dec 18, 2024

Choose a reason for hiding this comment

pgayvallet Dec 18, 2024

Choose a reason for hiding this comment

pgayvallet Dec 18, 2024

Choose a reason for hiding this comment

pgayvallet Dec 18, 2024

Choose a reason for hiding this comment

pgayvallet Dec 18, 2024

Choose a reason for hiding this comment

pgayvallet Dec 18, 2024

Choose a reason for hiding this comment

elasticmachine commented Dec 18, 2024

legrego left a comment

Choose a reason for hiding this comment

elasticmachine commented Dec 18, 2024

YulNaumenko left a comment

Choose a reason for hiding this comment

neptunian commented Dec 19, 2024 • edited Loading

neptunian commented Dec 19, 2024 • edited Loading

YulNaumenko commented Dec 19, 2024

YulNaumenko commented Dec 19, 2024

pgayvallet commented Dec 23, 2024

kibanamachine commented Dec 23, 2024

elasticmachine commented Dec 23, 2024

💚 Build Succeeded

Metrics [docs]

Module Count

Public APIs missing comments

Async chunks

Public APIs missing exports

Page load bundle

API count

History

kibanamachine commented Dec 23, 2024

💚 All backports created successfully

Questions ?

pgayvallet commented Dec 17, 2024 •

edited by kibanamachine

Loading

neptunian commented Dec 19, 2024 •

edited

Loading

neptunian commented Dec 19, 2024 •

edited

Loading