fix(langchain): handle secret str api keys #7430

albertjan · 2023-10-31T11:43:33Z

Currently the anthropic chain implementation in langchain uses a pydantic SecretStr as an api key this is causing errors in our pipeline when ddtrace tries to format the api key.

With this PR: langchain-ai/langchain#12542 the OpenAI implementation will also start using a SecretStr. I'm sure at that point there will be a few more people asking why things are broken.

I'm struggling setting up and running the tests, riot doesn't print anything. And I have no experience with the cassettes testing methods. Can someone help with this? I think if we add a test that uses the Anthropic LLM we will see the failure before. And this will fix it.

I've updated the type comment to the function, but the env doesn't know about Pydantic so I don't know if this is a valid thing to do.

Checklist

Change(s) are motivated and described in the PR description.
Testing strategy is described if automated tests are not included in the PR.
Risk is outlined (performance impact, potential for breakage, maintainability, etc).
Change is maintainable (easy to change, telemetry, documentation).
Library release note guidelines are followed. If no release note is required, add label changelog/no-changelog.
Documentation is included (in-code, generated user docs, public corp docs).
Backport labels are set (if applicable)

Reviewer Checklist

…them

Yun-Kim · 2023-10-31T15:25:15Z

Hi @albertjan, thanks for your contribution! I can take over testing and formatting and getting this merged.

Yun-Kim · 2023-10-31T19:20:37Z

The existing test cases should be sufficient since we continue to test latest versions of langchain - we'll be receiving SecretStr api keys (already are for AI21) once langchain continues to make these api key changes. Thanks again @albertjan for your contribution!

albertjan · 2023-10-31T22:03:00Z

You're welcome @Yun-Kim ☺️.

The existing test cases should be sufficient since we continue to test latest versions of langchain

A test using the ChatAnthropic would already exercise the SecretStr bit of the _format_api_key function. But yeah the tests will eventually exercise it. 👍

majorgreys

Do the existing tests cover the fixed case? I don't see any change to a span tag check.

ZStriker19 · 2023-11-02T16:42:28Z

Agreeing with Tahir, can we add a test for span.set_tag_str(API_KEY, _format_api_key(api_key)) # override api_key for Pinecone to verify the tag is changed as we expect?

Yun-Kim · 2023-11-02T17:28:21Z

@majorgreys @ZStriker19 the existing test cases cover this fixed case - langchain is in the process of switching over to use SecretStr api keys and already has been using SecretStr keys for the AI21 provider, which we already have a test case for.

can we add a test for span.set_tag_str(API_KEY, _format_api_key(api_key)) # override api_key for Pinecone to verify the tag is changed as we expect?

That change is just a refactor, no functionality change as we're not getting that information through langchain.

ddtrace/contrib/langchain/patch.py

Currently the anthropic chain implementation in langchain uses a pydantic SecretStr as an api key this is causing errors in our pipeline when ddtrace tries to format the api key. With this PR: langchain-ai/langchain#12542 the OpenAI implementation will also start using a SecretStr. I'm sure at that point there will be a few more people asking why things are broken. I'm struggling setting up and running the tests, riot doesn't print anything. And I have no experience with the cassettes testing methods. Can someone help with this? I think if we add a test that uses the Anthropic LLM we will see the failure before. And this will fix it. I've updated the type comment to the function, but the env doesn't know about Pydantic so I don't know if this is a valid thing to do. ## Checklist - [X] Change(s) are motivated and described in the PR description. - [x] Testing strategy is described if automated tests are not included in the PR. - [X] Risk is outlined (performance impact, potential for breakage, maintainability, etc). - [X] Change is maintainable (easy to change, telemetry, documentation). - [X] [Library release note guidelines](https://ddtrace.readthedocs.io/en/stable/releasenotes.html) are followed. If no release note is required, add label `changelog/no-changelog`. - [X] Documentation is included (in-code, generated user docs, [public corp docs](https://github.com/DataDog/documentation/)). - [x] Backport labels are set (if [applicable](https://ddtrace.readthedocs.io/en/latest/contributing.html#backporting)) ## Reviewer Checklist - [x] Title is accurate. - [x] No unnecessary changes are introduced. - [x] Description motivates each change. - [x] Avoids breaking [API](https://ddtrace.readthedocs.io/en/stable/versioning.html#interfaces) changes unless absolutely necessary. - [x] Testing strategy adequately addresses listed risk(s). - [x] Change is maintainable (easy to change, telemetry, documentation). - [x] Release note makes sense to a user of the library. - [x] Reviewer has explicitly acknowledged and discussed the performance implications of this PR as reported in the benchmarks PR comment. - [x] Backport labels are set in a manner that is consistent with the [release branch maintenance policy](https://ddtrace.readthedocs.io/en/latest/contributing.html#backporting) - [x] If this PR touches code that signs or publishes builds or packages, or handles credentials of any kind, I've requested a review from `@DataDog/security-design-and-guidance`. - [x] This PR doesn't touch any of that. --------- Co-authored-by: Yun Kim <[email protected]> Co-authored-by: Yun Kim <[email protected]> (cherry picked from commit 6dc61f5)

albertjan · 2023-11-03T22:07:15Z

Thanks all! 😄

Backport 6dc61f5 from #7430 to 1.20. Currently the anthropic chain implementation in langchain uses a pydantic SecretStr as an api key this is causing errors in our pipeline when ddtrace tries to format the api key. With this PR: langchain-ai/langchain#12542 the OpenAI implementation will also start using a SecretStr. I'm sure at that point there will be a few more people asking why things are broken. I'm struggling setting up and running the tests, riot doesn't print anything. And I have no experience with the cassettes testing methods. Can someone help with this? I think if we add a test that uses the Anthropic LLM we will see the failure before. And this will fix it. I've updated the type comment to the function, but the env doesn't know about Pydantic so I don't know if this is a valid thing to do. ## Checklist - [X] Change(s) are motivated and described in the PR description. - [x] Testing strategy is described if automated tests are not included in the PR. - [X] Risk is outlined (performance impact, potential for breakage, maintainability, etc). - [X] Change is maintainable (easy to change, telemetry, documentation). - [X] [Library release note guidelines](https://ddtrace.readthedocs.io/en/stable/releasenotes.html) are followed. If no release note is required, add label `changelog/no-changelog`. - [X] Documentation is included (in-code, generated user docs, [public corp docs](https://github.com/DataDog/documentation/)). - [x] Backport labels are set (if [applicable](https://ddtrace.readthedocs.io/en/latest/contributing.html#backporting)) ## Reviewer Checklist - [x] Title is accurate. - [x] No unnecessary changes are introduced. - [x] Description motivates each change. - [x] Avoids breaking [API](https://ddtrace.readthedocs.io/en/stable/versioning.html#interfaces) changes unless absolutely necessary. - [x] Testing strategy adequately addresses listed risk(s). - [x] Change is maintainable (easy to change, telemetry, documentation). - [x] Release note makes sense to a user of the library. - [x] Reviewer has explicitly acknowledged and discussed the performance implications of this PR as reported in the benchmarks PR comment. - [x] Backport labels are set in a manner that is consistent with the [release branch maintenance policy](https://ddtrace.readthedocs.io/en/latest/contributing.html#backporting) - [x] If this PR touches code that signs or publishes builds or packages, or handles credentials of any kind, I've requested a review from `@DataDog/security-design-and-guidance`. - [x] This PR doesn't touch any of that. --------- Co-authored-by: Albert-Jan Nijburg <[email protected]> Co-authored-by: Yun Kim <[email protected]>

Backport 6dc61f5 from #7430 to 2.1. Currently the anthropic chain implementation in langchain uses a pydantic SecretStr as an api key this is causing errors in our pipeline when ddtrace tries to format the api key. With this PR: langchain-ai/langchain#12542 the OpenAI implementation will also start using a SecretStr. I'm sure at that point there will be a few more people asking why things are broken. I'm struggling setting up and running the tests, riot doesn't print anything. And I have no experience with the cassettes testing methods. Can someone help with this? I think if we add a test that uses the Anthropic LLM we will see the failure before. And this will fix it. I've updated the type comment to the function, but the env doesn't know about Pydantic so I don't know if this is a valid thing to do. ## Checklist - [X] Change(s) are motivated and described in the PR description. - [x] Testing strategy is described if automated tests are not included in the PR. - [X] Risk is outlined (performance impact, potential for breakage, maintainability, etc). - [X] Change is maintainable (easy to change, telemetry, documentation). - [X] [Library release note guidelines](https://ddtrace.readthedocs.io/en/stable/releasenotes.html) are followed. If no release note is required, add label `changelog/no-changelog`. - [X] Documentation is included (in-code, generated user docs, [public corp docs](https://github.com/DataDog/documentation/)). - [x] Backport labels are set (if [applicable](https://ddtrace.readthedocs.io/en/latest/contributing.html#backporting)) ## Reviewer Checklist - [x] Title is accurate. - [x] No unnecessary changes are introduced. - [x] Description motivates each change. - [x] Avoids breaking [API](https://ddtrace.readthedocs.io/en/stable/versioning.html#interfaces) changes unless absolutely necessary. - [x] Testing strategy adequately addresses listed risk(s). - [x] Change is maintainable (easy to change, telemetry, documentation). - [x] Release note makes sense to a user of the library. - [x] Reviewer has explicitly acknowledged and discussed the performance implications of this PR as reported in the benchmarks PR comment. - [x] Backport labels are set in a manner that is consistent with the [release branch maintenance policy](https://ddtrace.readthedocs.io/en/latest/contributing.html#backporting) - [x] If this PR touches code that signs or publishes builds or packages, or handles credentials of any kind, I've requested a review from `@DataDog/security-design-and-guidance`. - [x] This PR doesn't touch any of that. --------- Co-authored-by: Albert-Jan Nijburg <[email protected]> Co-authored-by: Yun Kim <[email protected]>

Backport 6dc61f5 from #7430 to 2.0. Currently the anthropic chain implementation in langchain uses a pydantic SecretStr as an api key this is causing errors in our pipeline when ddtrace tries to format the api key. With this PR: langchain-ai/langchain#12542 the OpenAI implementation will also start using a SecretStr. I'm sure at that point there will be a few more people asking why things are broken. I'm struggling setting up and running the tests, riot doesn't print anything. And I have no experience with the cassettes testing methods. Can someone help with this? I think if we add a test that uses the Anthropic LLM we will see the failure before. And this will fix it. I've updated the type comment to the function, but the env doesn't know about Pydantic so I don't know if this is a valid thing to do. ## Checklist - [X] Change(s) are motivated and described in the PR description. - [x] Testing strategy is described if automated tests are not included in the PR. - [X] Risk is outlined (performance impact, potential for breakage, maintainability, etc). - [X] Change is maintainable (easy to change, telemetry, documentation). - [X] [Library release note guidelines](https://ddtrace.readthedocs.io/en/stable/releasenotes.html) are followed. If no release note is required, add label `changelog/no-changelog`. - [X] Documentation is included (in-code, generated user docs, [public corp docs](https://github.com/DataDog/documentation/)). - [x] Backport labels are set (if [applicable](https://ddtrace.readthedocs.io/en/latest/contributing.html#backporting)) ## Reviewer Checklist - [x] Title is accurate. - [x] No unnecessary changes are introduced. - [x] Description motivates each change. - [x] Avoids breaking [API](https://ddtrace.readthedocs.io/en/stable/versioning.html#interfaces) changes unless absolutely necessary. - [x] Testing strategy adequately addresses listed risk(s). - [x] Change is maintainable (easy to change, telemetry, documentation). - [x] Release note makes sense to a user of the library. - [x] Reviewer has explicitly acknowledged and discussed the performance implications of this PR as reported in the benchmarks PR comment. - [x] Backport labels are set in a manner that is consistent with the [release branch maintenance policy](https://ddtrace.readthedocs.io/en/latest/contributing.html#backporting) - [x] If this PR touches code that signs or publishes builds or packages, or handles credentials of any kind, I've requested a review from `@DataDog/security-design-and-guidance`. - [x] This PR doesn't touch any of that. --------- Co-authored-by: Albert-Jan Nijburg <[email protected]> Co-authored-by: Yun Kim <[email protected]> Co-authored-by: Yun Kim <[email protected]>

albertjan added 2 commits October 31, 2023 11:03

fix(langchain): Correctly handle SecretStr api_keys while formatting …

be67f92

…them

release note

13db8e8

albertjan requested review from a team as code owners October 31, 2023 11:43

albertjan requested review from majorgreys, Yun-Kim, ZStriker19 and tabgok October 31, 2023 11:43

Merge branch '2.x' into fix/handling_of_secret_str_api_keys

032c329

Yun-Kim and others added 2 commits October 31, 2023 15:12

Fix formatting, release note

b8c6553

Merge branch '2.x' into fix/handling_of_secret_str_api_keys

2733ce8

Yun-Kim changed the title ~~fix(langchain): Correctly handle SecretStr api_keys while formatting~~ fix(langchain): handle secret str api keys Oct 31, 2023

Yun-Kim added backport 1.20 backport 2.0 labels Oct 31, 2023

Yun-Kim approved these changes Oct 31, 2023

View reviewed changes

Yun-Kim enabled auto-merge (squash) October 31, 2023 19:37

majorgreys reviewed Nov 1, 2023

View reviewed changes

Yun-Kim closed this Nov 2, 2023

auto-merge was automatically disabled November 2, 2023 18:07
Pull request was closed

Yun-Kim reopened this Nov 2, 2023

Yun-Kim enabled auto-merge (squash) November 2, 2023 18:08

Yun-Kim and others added 2 commits November 2, 2023 14:08

Merge branch '2.x' into fix/handling_of_secret_str_api_keys

1f0d4c0

Merge branch '2.x' into fix/handling_of_secret_str_api_keys

caf44d6

Yun-Kim requested a review from mabdinur November 3, 2023 15:08

Merge branch '2.x' into fix/handling_of_secret_str_api_keys

dcf1e34

Yun-Kim reviewed Nov 3, 2023

View reviewed changes

ddtrace/contrib/langchain/patch.py Show resolved Hide resolved

ZStriker19 approved these changes Nov 3, 2023

View reviewed changes

Yun-Kim added 2 commits November 3, 2023 15:39

Merge branch '2.x' into fix/handling_of_secret_str_api_keys

e9331c2

Merge branch '2.x' into fix/handling_of_secret_str_api_keys

2ab4d13

Yun-Kim merged commit 6dc61f5 into DataDog:2.x Nov 3, 2023

github-actions bot mentioned this pull request Nov 3, 2023

fix(langchain): handle secret str api keys [backport 1.20] #7478

Merged

18 tasks

github-actions bot mentioned this pull request Nov 3, 2023

fix(langchain): handle secret str api keys [backport 2.0] #7479

Merged

18 tasks

github-actions bot mentioned this pull request Nov 3, 2023

fix(langchain): handle secret str api keys [backport 2.1] #7480

Merged

18 tasks

albertjan deleted the fix/handling_of_secret_str_api_keys branch November 4, 2023 07:59

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix(langchain): handle secret str api keys #7430

fix(langchain): handle secret str api keys #7430

albertjan commented Oct 31, 2023 •

edited by Yun-Kim

Loading

Yun-Kim commented Oct 31, 2023

Yun-Kim commented Oct 31, 2023

albertjan commented Oct 31, 2023 •

edited

Loading

majorgreys left a comment

ZStriker19 commented Nov 2, 2023

Yun-Kim commented Nov 2, 2023

albertjan commented Nov 3, 2023

fix(langchain): handle secret str api keys #7430

fix(langchain): handle secret str api keys #7430

Conversation

albertjan commented Oct 31, 2023 • edited by Yun-Kim Loading

Checklist

Reviewer Checklist

Yun-Kim commented Oct 31, 2023

Yun-Kim commented Oct 31, 2023

albertjan commented Oct 31, 2023 • edited Loading

majorgreys left a comment

Choose a reason for hiding this comment

ZStriker19 commented Nov 2, 2023

Yun-Kim commented Nov 2, 2023

albertjan commented Nov 3, 2023

albertjan commented Oct 31, 2023 •

edited by Yun-Kim

Loading

albertjan commented Oct 31, 2023 •

edited

Loading