core: allow artifact in create_retriever_tool #28903

ianchi · 2024-12-24T12:16:27Z

Add option to return content and artifacts, to also be able to access the full info of the retrieved documents.

They are returned as a list of dicts in the artifacts property if parameter response_format is set to "content_and_artifact".

Defaults to "content" to keep current behavior.

vercel · 2024-12-24T12:16:31Z

The latest updates on your projects. Learn more about Vercel for Git ↗︎

1 Skipped Deployment

Name	Status	Preview	Comments	Updated (UTC)
langchain	⬜️ Ignored (Inspect)	Visit Preview		Dec 24, 2024 4:00pm

efriis

2 questions/suggestions, and will need some unit tests to be mergable

efriis · 2024-12-24T20:45:57Z

libs/core/langchain_core/tools/retriever.py

        [await aformat_document(doc, document_prompt) for doc in docs]
    )

+    if response_format == "content_and_artifact":
+        return (content, [doc.model_dump() for doc in docs])


might make sense to just return docs as the artifact - any reason to dump here in particular?

The reason is to always have the same type after deserialization.

As artifact has type Any when deserializing a ToolMessage from json or any other source (api call, DB, etc), Pydantic won't be able to do any "magic" to regenerate the artifact as a list[Document] but it will be a plain list[dict].

So, to be consistent and always have a dict I opted to dump.
Otherwise you need to add custom logic when deserializing to have consistent types.

efriis · 2024-12-24T20:46:16Z

libs/core/langchain_core/tools/retriever.py

        format_document(doc, document_prompt) for doc in docs
    )
+    if response_format == "content_and_artifact":
+        return (content, [doc.model_dump() for doc in docs])


same q as on async one

Same reasoning as above

efriis · 2024-12-24T20:47:20Z

libs/core/langchain_core/tools/retriever.py

@@ -55,6 +66,7 @@ def create_retriever_tool(
    *,
    document_prompt: Optional[BasePromptTemplate] = None,
    document_separator: str = "\n\n",
+    response_format: Literal["content", "content_and_artifact"] = "content",


needs to be added to docstring

dosubot bot added the size:S This PR changes 10-29 lines, ignoring generated files. label Dec 24, 2024

dosubot bot added Ɑ: core Related to langchain-core Ɑ: retriever Related to retriever module labels Dec 24, 2024

core: allow artifact in create_retriever_tool

c04c85d

ianchi force-pushed the retriever_tool branch from eef8b40 to c04c85d Compare December 24, 2024 13:15

core: retriever_tool fix linting

458d538

efriis reviewed Dec 24, 2024

View reviewed changes

efriis self-assigned this Dec 24, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

core: allow artifact in create_retriever_tool #28903

core: allow artifact in create_retriever_tool #28903

ianchi commented Dec 24, 2024

vercel bot commented Dec 24, 2024 •

edited

Loading

efriis left a comment

efriis Dec 24, 2024

ianchi Dec 24, 2024

efriis Dec 24, 2024

ianchi Dec 24, 2024

efriis Dec 24, 2024

core: allow artifact in create_retriever_tool #28903

Are you sure you want to change the base?

core: allow artifact in create_retriever_tool #28903

Conversation

ianchi commented Dec 24, 2024

vercel bot commented Dec 24, 2024 • edited Loading

efriis left a comment

Choose a reason for hiding this comment

efriis Dec 24, 2024

Choose a reason for hiding this comment

ianchi Dec 24, 2024

Choose a reason for hiding this comment

efriis Dec 24, 2024

Choose a reason for hiding this comment

ianchi Dec 24, 2024

Choose a reason for hiding this comment

efriis Dec 24, 2024

Choose a reason for hiding this comment

vercel bot commented Dec 24, 2024 •

edited

Loading