core: Use Blockbuster to detect blocking calls in asyncio during tests #29043

cbornet · 2025-01-06T14:43:50Z

This PR uses the blockbuster library in langchain-core to detect blocking calls made in the asyncio event loop during unit tests.
Avoiding blocking calls is hard as these can be deeply buried in the code or made in 3rd party libraries.
Blockbuster makes it easier to detect them by raising an exception when a call is made to a known blocking function (eg: time.sleep).

Adding blockbuster allowed to find a blocking call in aconfig_with_context (it ends up calling get_function_nonlocals which loads function code).

Dependencies:

blockbuster (test)

Twitter handle: cbornet_

vercel · 2025-01-06T14:43:55Z

The latest updates on your projects. Learn more about Vercel for Git ↗︎

1 Skipped Deployment

Name	Status	Preview	Comments	Updated (UTC)
langchain	⬜️ Ignored (Inspect)	Visit Preview		Jan 13, 2025 3:22pm

cbornet · 2025-01-06T15:11:45Z

libs/core/tests/unit_tests/prompts/test_chat.py


-    with tempfile.NamedTemporaryFile(delete=True, suffix=".jpg") as temp_file:


It's useless to use a real file here.

cbornet · 2025-01-06T15:13:49Z

libs/core/tests/unit_tests/runnables/test_runnable_events_v2.py

    """Test invoking nested runnable lambda."""
+    blockbuster.deactivate()


The code makes a sync call from async on purpose...

eyurtsev

Looks good overall. Should we enable/disable for unit tests?

I like that it forces us to separate async tests from sync tests and avoid being lazy, but at the same time there's some changes that aren't technically necessary and seem to unnecessarily complicate the testing code (e.g., updating a file open to a non blocking request)?

eyurtsev · 2025-01-08T17:20:00Z

libs/core/langchain_core/beta/runnables/context.py

@@ -121,7 +121,7 @@ def _config_with_context(
    return patch_config(config, configurable=context_funcs)


-def aconfig_with_context(
+async def aconfig_with_context(


technically breaking change, but probably okay looks a lot like a private function to me.

Would you mind adding a comment about why this needs to be async (i.e., is inspect.get_source is making os calls?)

Yes, inspect.get_source makes os.stat calls to check if a source file has been updated from the linecache and FS read calls to get the code if the linecache needs to be updated (done at least at the first access).
Note that an LRU cache was added probably because these os calls have an impact on perf ?#28131
Thinking about it, it may be cleaner to have a aconfig_specs in Runnable that defaults to calling config_specs (not in a thread) and that we can override to use a thread for RunnableLambda. WDYT ?

eyurtsev · 2025-01-08T17:45:07Z

libs/core/tests/unit_tests/vectorstores/test_in_memory.py

@@ -90,9 +91,11 @@ async def test_inmemory_dump_load(tmp_path: Path) -> None:
    output = await store.asimilarity_search("foo", k=1)

    test_file = str(tmp_path / "test.json")
-    store.dump(test_file)
+    await asyncio.to_thread(store.dump, test_file)


Feels like a false positive for the test? it doesn't really matter whether this is using blocking or non blocking code here?

Well, with the way blockbuster is configured here (activated before the test, deativated after), the test code itself needs to be non-blocking.
Maybe we could add aload/adump methods to InMemoryVectorStore using aiofile/aiofiles?

eyurtsev · 2025-01-08T17:45:58Z

libs/core/tests/unit_tests/runnables/test_tracing_interops.py

@@ -298,17 +299,17 @@ def parent(a: int) -> int:
    # Now run the chain and check the resulting posts
    cb = [tracer]
    if method == "invoke":
-        res: Any = parent.invoke(1, {"callbacks": cb})  # type: ignore
+        res: Any = await asyncio.to_thread(parent.invoke, 1, {"callbacks": cb})  # type: ignore


Is there any benefit to doing this? feel like it's complicating the test code?

I refactored this part to cleanly separate sync and async tests: see c2162c2
With this the asyncio.to_thread calls are not needed.

eyurtsev

OK makes sense -- I'll finish reviewing the test changes on Monday and will merge.

If there were a way to configure block buster to fail based on the stack trace (e.g., we mostly care about async calls to langchain APIs) , it'll probably make it more valuable (or at least less effort to adopt widely)

test_foo.py

async def test1():
  ---
   with open(..) as f: # blocking call but we don't care in 99% of the time
    ...
   ---- 
   # Blocking call from calling sync method in async test.. we probably care about it
   # but there might be some exceptions
   runnable.invoke()  

  # blocking call from an async api
  # we care about this 100% of the time
  await  some_langchain_thing() # results in a blocking call

asyncio.Event is not thread-safe so it must be created in the asyncio thread

cbornet · 2025-01-13T16:22:25Z

If there were a way to configure block buster to fail based on the stack trace (e.g., we mostly care about async calls to langchain APIs) , it'll probably make it more valuable (or at least less effort to adopt widely)

I can have a look at that but this means inspecting the full stack to search for langchain_core files. This is a slow operation and I don't know if it will be performant enough.

cbornet · 2025-01-13T16:36:12Z

libs/core/tests/unit_tests/conftest.py

+            (
+                bb.functions[func]
+                .can_block_in("langchain_core/_api/internal.py", "is_caller_internal")
+                .can_block_in("langchain_core/runnables/base.py", "__repr__")


RunnableLambda's __repr__ calls get_lambda_source which is blocking. It should probably be cached.

dosubot bot added the size:L This PR changes 100-499 lines, ignoring generated files. label Jan 6, 2025

cbornet force-pushed the blockbuster branch 2 times, most recently from 0983c3d to 7ce4b18 Compare January 6, 2025 15:10

cbornet commented Jan 6, 2025

View reviewed changes

ccurme assigned eyurtsev Jan 6, 2025

SANTHOSH-SACHIN mentioned this pull request Jan 7, 2025

Template variable handling in ChatPromptTemplate #29034 #29056

Closed

1 task

eyurtsev reviewed Jan 8, 2025

View reviewed changes

dosubot bot added size:XL This PR changes 500-999 lines, ignoring generated files. and removed size:L This PR changes 100-499 lines, ignoring generated files. labels Jan 9, 2025

cbornet force-pushed the blockbuster branch 5 times, most recently from 188abdf to 4598e0d Compare January 10, 2025 17:27

eyurtsev reviewed Jan 10, 2025

View reviewed changes

cbornet force-pushed the blockbuster branch 2 times, most recently from 3a082ce to e5c91e0 Compare January 13, 2025 13:12

cbornet added 3 commits January 13, 2025 14:30

Use Blockbuster to detect blocking calls in asyncio during tests

f1c2af6

Refactor test_runnable_sequence_parallel_trace_nesting

a4d6433

Fix aconfig_with_context

a57e446

asyncio.Event is not thread-safe so it must be created in the asyncio thread

cbornet force-pushed the blockbuster branch 3 times, most recently from 48dd730 to 3c8ab33 Compare January 13, 2025 15:11

Fix tests

82ce644

cbornet force-pushed the blockbuster branch from 3c8ab33 to 82ce644 Compare January 13, 2025 15:22

cbornet commented Jan 13, 2025

View reviewed changes

cbornet mentioned this pull request Jan 14, 2025

core: Cache RunnableLambda __repr__ #29199

Open

cbornet mentioned this pull request Jan 14, 2025

core: Cache RunnableLambda deps #29200

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

core: Use Blockbuster to detect blocking calls in asyncio during tests #29043

core: Use Blockbuster to detect blocking calls in asyncio during tests #29043

cbornet commented Jan 6, 2025 •

edited

Loading

vercel bot commented Jan 6, 2025 •

edited

Loading

cbornet Jan 6, 2025

cbornet Jan 6, 2025

eyurtsev left a comment

eyurtsev Jan 8, 2025

cbornet Jan 13, 2025 •

edited

Loading

eyurtsev Jan 8, 2025

cbornet Jan 9, 2025

eyurtsev Jan 8, 2025

cbornet Jan 9, 2025 •

edited

Loading

eyurtsev left a comment •

edited

Loading

cbornet commented Jan 13, 2025

cbornet Jan 13, 2025


		with tempfile.NamedTemporaryFile(delete=True, suffix=".jpg") as temp_file:

		"""Test invoking nested runnable lambda."""
		blockbuster.deactivate()

core: Use Blockbuster to detect blocking calls in asyncio during tests #29043

Are you sure you want to change the base?

core: Use Blockbuster to detect blocking calls in asyncio during tests #29043

Conversation

cbornet commented Jan 6, 2025 • edited Loading

vercel bot commented Jan 6, 2025 • edited Loading

cbornet Jan 6, 2025

Choose a reason for hiding this comment

cbornet Jan 6, 2025

Choose a reason for hiding this comment

eyurtsev left a comment

Choose a reason for hiding this comment

eyurtsev Jan 8, 2025

Choose a reason for hiding this comment

cbornet Jan 13, 2025 • edited Loading

Choose a reason for hiding this comment

eyurtsev Jan 8, 2025

Choose a reason for hiding this comment

cbornet Jan 9, 2025

Choose a reason for hiding this comment

eyurtsev Jan 8, 2025

Choose a reason for hiding this comment

cbornet Jan 9, 2025 • edited Loading

Choose a reason for hiding this comment

eyurtsev left a comment • edited Loading

Choose a reason for hiding this comment

cbornet commented Jan 13, 2025

cbornet Jan 13, 2025

Choose a reason for hiding this comment

cbornet commented Jan 6, 2025 •

edited

Loading

vercel bot commented Jan 6, 2025 •

edited

Loading

cbornet Jan 13, 2025 •

edited

Loading

cbornet Jan 9, 2025 •

edited

Loading

eyurtsev left a comment •

edited

Loading