feat(google-genai): Context Caching #7169

kwei-zhang · 2024-11-07T11:44:22Z

Implemented conetxt caching feature for google-genai, we now allow user to caching a file and create gen-ai model base on the cached content

vercel · 2024-11-07T11:44:28Z

The latest updates on your projects. Learn more about Vercel for Git ↗︎

Name	Status	Preview	Comments	Updated (UTC)
langchainjs-docs	✅ Ready (Inspect)	Visit Preview		Dec 4, 2024 3:09am

1 Skipped Deployment

Name	Status	Preview	Comments	Updated (UTC)
langchainjs-api-refs	⬜️ Ignored (Inspect)			Dec 4, 2024 3:09am

kwei-zhang · 2024-11-13T00:16:06Z

Hi @jacoblee93, we've implemented the foundational structure for context caching with files. Could you take a look and let us know if the code aligns with our intended design? Thank you!

jacoblee93 · 2024-11-17T00:07:51Z

Looks reasonable to me - @afirstenberg can you have a look? It looks similar to some work you've done on Vertex.

We will also want to write up some docs!

libs/langchain-google-genai/src/tests/context_caching.int.test.ts

chaunguyenm · 2024-11-18T22:11:45Z

Looks reasonable to me - @afirstenberg can you have a look? It looks similar to some work you've done on Vertex.

We will also want to write up some docs!

We will be working on the docs now, but please keep us updated if there's any suggestion on the design or the tests. Thank you both!

chaunguyenm · 2024-11-19T17:32:33Z

@jacoblee93 @afirstenberg We currently add a wrapper around GoogleAIFileManager and GoogleAICacheManager to support context caching, but the wrapper doesn't provide additional functionalities. This is because we are not sure if we can rely on users installing and using google/generative-ai/server package directly. Do you have a suggestion on how we can structure this? Thanks a lot.

jacoblee93 · 2024-12-03T22:17:08Z

libs/langchain-google-genai/src/tests/context_caching.int.test.ts

+
+  const filename = fileURLToPath(import.meta.url);
+  const dirname = path.dirname(filename);
+  const pathToVideoFile = path.join(dirname, "/data/Sherlock_Jr_FullMovie.mp4");


Can we use a smaller file for this test? Maybe one of the files that already exist in the repo?

jacoblee93 · 2024-12-04T01:58:18Z

Thank you! Removed the intermediate class you added in favor of just the raw managers

It wasn't exported anyway

jacoblee93 · 2024-12-04T03:15:06Z

Also renamed to useCachedContent

Thanks again, will go live shortly!

Co-authored-by: Chau Nguyen <[email protected]> Co-authored-by: jacoblee93 <[email protected]>

dosubot bot added the size:L This PR changes 100-499 lines, ignoring generated files. label Nov 7, 2024

dosubot bot added the auto:improvement Medium size change to existing code to handle new use-cases label Nov 7, 2024

kwei-zhang marked this pull request as draft November 7, 2024 11:45

vercel bot deployed to Preview – langchainjs-docs November 7, 2024 11:53 View deployment

kwei-zhang force-pushed the google-genai-context-caching branch from 8e039e2 to b065a2f Compare November 7, 2024 23:26

vercel bot deployed to Preview – langchainjs-docs November 7, 2024 23:35 View deployment

kwei-zhang marked this pull request as ready for review November 13, 2024 17:30

Chau Nguyen added 4 commits November 13, 2024 12:30

update google-genai version and add method to create model from cache

5146e14

added test

4348cda

fixed test

a447435

lint and format

6658acb

kwei-zhang force-pushed the google-genai-context-caching branch from b065a2f to 6658acb Compare November 13, 2024 17:30

vercel bot deployed to Preview – langchainjs-docs November 13, 2024 17:43 View deployment

jacoblee93 reviewed Nov 17, 2024

View reviewed changes

libs/langchain-google-genai/src/tests/context_caching.int.test.ts Show resolved Hide resolved

jacoblee93 added the close PRs that need one or two touch-ups to be ready label Nov 17, 2024

added doc

99c7b9b

vercel bot deployed to Preview – langchainjs-docs November 19, 2024 18:14 View deployment

jacoblee93 reviewed Dec 3, 2024

View reviewed changes

Merge

8a0b33b

vercel bot deployed to Preview – langchainjs-docs December 4, 2024 01:46 View deployment

Remove unexported code

4d88f0a

jacoblee93 removed the close PRs that need one or two touch-ups to be ready label Dec 4, 2024

jacoblee93 approved these changes Dec 4, 2024

View reviewed changes

dosubot bot added the lgtm PRs that are ready to be merged as-is label Dec 4, 2024

jacoblee93 changed the title ~~google-genai [feature]: Context Caching~~ feat(google-genai): Context Caching Dec 4, 2024

vercel bot deployed to Preview – langchainjs-docs December 4, 2024 02:09 View deployment

Fix notebook

a4ebc16

vercel bot deployed to Preview – langchainjs-docs December 4, 2024 03:09 View deployment

jacoblee93 merged commit 5f62174 into langchain-ai:main Dec 4, 2024
28 checks passed

syntaxsec pushed a commit to aks-456/langchainjs that referenced this pull request Dec 13, 2024

feat(google-genai): Context Caching (langchain-ai#7169)

16e3d54

Co-authored-by: Chau Nguyen <[email protected]> Co-authored-by: jacoblee93 <[email protected]>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat(google-genai): Context Caching #7169

feat(google-genai): Context Caching #7169

kwei-zhang commented Nov 7, 2024

vercel bot commented Nov 7, 2024 •

edited

Loading

kwei-zhang commented Nov 13, 2024

jacoblee93 commented Nov 17, 2024

chaunguyenm commented Nov 18, 2024

chaunguyenm commented Nov 19, 2024

jacoblee93 Dec 3, 2024

jacoblee93 commented Dec 4, 2024

jacoblee93 commented Dec 4, 2024

feat(google-genai): Context Caching #7169

feat(google-genai): Context Caching #7169

Conversation

kwei-zhang commented Nov 7, 2024

vercel bot commented Nov 7, 2024 • edited Loading

kwei-zhang commented Nov 13, 2024

jacoblee93 commented Nov 17, 2024

chaunguyenm commented Nov 18, 2024

chaunguyenm commented Nov 19, 2024

jacoblee93 Dec 3, 2024

Choose a reason for hiding this comment

jacoblee93 commented Dec 4, 2024

jacoblee93 commented Dec 4, 2024

vercel bot commented Nov 7, 2024 •

edited

Loading