[WIP] Support for Capella-hosted embedding models #68

glennga · 2025-01-30T16:34:43Z

Putting a hold on this -- going to wait for the "init" command PR to be pushed.

TODO:

tests for different sentence transformers models
tests for embedding model use from OpenAI endpoint

…be pushed. TODO: [] - tests for different sentence transformers models [] - tests for embedding model use from OpenAI endpoint

TJ202 · 2025-01-31T08:55:04Z

docs/source/env.rst

+``AGENT_CATALOG_EMBEDDING_MODEL_AUTH``
+    The field used in the authorization header of all OpenAI-standard client embedding requests.
+    For embedding models hosted by OpenAI, this field refers to the API key.
+    For embedding models hosted by Capella, this field refers to the Base64-encoded value of


users provide the database conn uname and password
this field would not be a requirement and instead the request can contain the header with uname and pwd
i assume we dont need the user to get this encoded string

If the users provide this field, we will use this (as we want to support not just Capella, but all OpenAI-standard endpoints) -- but yes, we can use the AGENT_CATALOG_USERNAME and AGENT_CATALOG_PASSWORD variables to create this encoding string if AGENT_CATALOG_EMBEDDING_MODEL_AUTH is not set.

TJ202 · 2025-01-31T08:59:28Z

libs/agentc_core/agentc_core/catalog/descriptor.py

-        "of each catalog entry.",
-        examples=["sentence-transformers/all-MiniLM-L12-v2"],
+    embedding_model: EmbeddingModel = pydantic.Field(
+        description="Embedding model used for all descriptions in the catalog.",


Suggested change

description="Embedding model used for all descriptions in the catalog.",

description="Embedding model used to generate embedding for tool/prompt descriptions to store in the catalogs.",

something like this would be more suitable

TJ202 · 2025-01-31T09:00:43Z

libs/agentc_core/agentc_core/learned/embedding.py


            # Grab our local tool embedding model...
            local_tool_catalog_path = self.catalog_path / DEFAULT_TOOL_CATALOG_NAME
            if local_tool_catalog_path.exists():
                with local_tool_catalog_path.open("r") as fp:
                    local_tool_catalog = CatalogDescriptor.model_validate_json(fp.read())
-                collected_embedding_model_names.add(local_tool_catalog.embedding_model)
+                collected_embedding_models.add(local_tool_catalog.embedding_model)

            # ...and now our local prompt embedding model.


do users have the provision to specify different models for tool and prompt each?
if im not wrong, AGENT_CATALOG_EMBEDDING_MODEL defines the embedding model used and its for both prompts and tools

they shouldn't, but this is the constraint we have now (because right now we have two metadata collections). in the future, we want to remove this

TJ202 · 2025-01-31T09:04:18Z

docs/source/env.rst

    The embedding model that Agent Catalog will use when indexing and querying tools and prompts.
    This *must* be a valid embedding model that is supported by the :python:`sentence_transformers.SentenceTransformer`
-    class.
+    class *or* the name of a model that can be used from the endpoint specified in the environment variable
+    ``AGENT_CATALOG_EMBEDDING_MODEL_URL``.


can we specify the allowed models for openai in the docs
what can the value for this and url be to hit the /endpoint route

there aren't really allowed models, its whatever the model service provides (given that the model supports function calling, etc...)

TJ202 · 2025-01-31T09:05:50Z

libs/agentc_core/agentc_core/learned/model.py

+    )
+    base_url: typing.Optional[str] = pydantic.Field(
+        description="The base URL of the embedding model."
+        "This field must be specified is using a non-SentenceTransformers-based model.",


Suggested change

"This field must be specified is using a non-SentenceTransformers-based model.",

"This field must be specified if using a non-SentenceTransformers-based model.",

TJ202 · 2025-01-31T09:06:31Z

libs/agentc_core/agentc_core/learned/model.py

+
+
+class EmbeddingModel(pydantic.BaseModel):
+    kind: typing.Literal["sentence-transformers", "openai"] = pydantic.Field(


for caepella models, what is the kind?

openai (because Capella is supposed to be OpenAI-compliant)

TJ202 · 2025-01-31T09:07:12Z

libs/agentc_core/agentc_core/learned/model.py

+    )
+    name: str = pydantic.Field(
+        description="The name of the embedding model being used.",
+        examples=["all-MiniLM-L12-v2", "https://12fs345d.apps.cloud.couchbase.com"],


name of model if its on capella is intfloat/e5-mistral-7b-instruct
last i checked this is the only embedding model they have hosted

https://12fs345d.apps.cloud.couchbase.com is more of the endpoint fqdn url if im not wrong

yep, this name example is wrong 😅

ThejasNU · 2025-02-04T05:21:41Z

Putting a hold on this -- going to wait for the "init" command PR to be pushed.

it would be better if you merge this to dev branch instead of master

Putting a hold on this -- going to wait for the "init" command PR to …

c1eaac0

…be pushed. TODO: [] - tests for different sentence transformers models [] - tests for embedding model use from OpenAI endpoint

glennga added the enhancement New feature or request label Jan 30, 2025

glennga self-assigned this Jan 30, 2025

TJ202 reviewed Jan 31, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[WIP] Support for Capella-hosted embedding models #68

[WIP] Support for Capella-hosted embedding models #68

glennga commented Jan 30, 2025 •

edited by ThejasNU

Loading

TJ202 Jan 31, 2025

glennga Feb 3, 2025

TJ202 Jan 31, 2025

TJ202 Jan 31, 2025 •

edited

Loading

glennga Feb 3, 2025

TJ202 Jan 31, 2025

glennga Feb 3, 2025

TJ202 Jan 31, 2025

TJ202 Jan 31, 2025

glennga Feb 3, 2025

TJ202 Jan 31, 2025

TJ202 Jan 31, 2025

glennga Feb 3, 2025

ThejasNU commented Feb 4, 2025

	description="Embedding model used for all descriptions in the catalog.",
	description="Embedding model used to generate embedding for tool/prompt descriptions to store in the catalogs.",

	"This field must be specified is using a non-SentenceTransformers-based model.",
	"This field must be specified if using a non-SentenceTransformers-based model.",



		class EmbeddingModel(pydantic.BaseModel):
		kind: typing.Literal["sentence-transformers", "openai"] = pydantic.Field(

[WIP] Support for Capella-hosted embedding models #68

Are you sure you want to change the base?

[WIP] Support for Capella-hosted embedding models #68

Conversation

glennga commented Jan 30, 2025 • edited by ThejasNU Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

TJ202 Jan 31, 2025 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

ThejasNU commented Feb 4, 2025

glennga commented Jan 30, 2025 •

edited by ThejasNU

Loading

TJ202 Jan 31, 2025 •

edited

Loading