langgraph: actually run test_large_cases_async #1307
Triggered via pull request
December 19, 2024 22:17
Status
Success
Total duration
45m 34s
Artifacts
–
Annotations
1 warning and 2 notices
benchmark
ubuntu-latest pipelines will use ubuntu-24.04 soon. For more details, see https://github.com/actions/runner-images/issues/10636
|
Benchmark results:
libs/langgraph/tests/__snapshots__/test_large_cases_async.ambr#L1
.........................................
fanout_to_subgraph_10x: Mean +- std dev: 60.3 ms +- 1.2 ms
.........................................
fanout_to_subgraph_10x_sync: Mean +- std dev: 51.9 ms +- 0.8 ms
.........................................
fanout_to_subgraph_10x_checkpoint: Mean +- std dev: 73.1 ms +- 1.0 ms
.........................................
fanout_to_subgraph_10x_checkpoint_sync: Mean +- std dev: 93.6 ms +- 0.9 ms
.........................................
fanout_to_subgraph_100x: Mean +- std dev: 597 ms +- 22 ms
.........................................
fanout_to_subgraph_100x_sync: Mean +- std dev: 507 ms +- 6 ms
.........................................
fanout_to_subgraph_100x_checkpoint: Mean +- std dev: 736 ms +- 12 ms
.........................................
fanout_to_subgraph_100x_checkpoint_sync: Mean +- std dev: 936 ms +- 17 ms
.........................................
react_agent_10x: Mean +- std dev: 30.4 ms +- 0.6 ms
.........................................
react_agent_10x_sync: Mean +- std dev: 22.6 ms +- 0.4 ms
.........................................
react_agent_10x_checkpoint: Mean +- std dev: 37.3 ms +- 0.7 ms
.........................................
react_agent_10x_checkpoint_sync: Mean +- std dev: 36.4 ms +- 0.4 ms
.........................................
react_agent_100x: Mean +- std dev: 336 ms +- 6 ms
.........................................
react_agent_100x_sync: Mean +- std dev: 272 ms +- 2 ms
.........................................
react_agent_100x_checkpoint: Mean +- std dev: 829 ms +- 5 ms
.........................................
react_agent_100x_checkpoint_sync: Mean +- std dev: 831 ms +- 12 ms
.........................................
wide_state_25x300: Mean +- std dev: 22.8 ms +- 0.5 ms
.........................................
wide_state_25x300_sync: Mean +- std dev: 14.9 ms +- 0.6 ms
.........................................
wide_state_25x300_checkpoint: Mean +- std dev: 273 ms +- 13 ms
.........................................
wide_state_25x300_checkpoint_sync: Mean +- std dev: 272 ms +- 12 ms
.........................................
wide_state_15x600: Mean +- std dev: 26.6 ms +- 0.6 ms
.........................................
wide_state_15x600_sync: Mean +- std dev: 17.1 ms +- 0.2 ms
.........................................
wide_state_15x600_checkpoint: Mean +- std dev: 469 ms +- 13 ms
.........................................
wide_state_15x600_checkpoint_sync: Mean +- std dev: 470 ms +- 14 ms
.........................................
wide_state_9x1200: Mean +- std dev: 26.6 ms +- 0.5 ms
.........................................
wide_state_9x1200_sync: Mean +- std dev: 17.1 ms +- 0.1 ms
.........................................
wide_state_9x1200_checkpoint: Mean +- std dev: 308 ms +- 16 ms
.........................................
wide_state_9x1200_checkpoint_sync: Mean +- std dev: 321 ms +- 21 ms
|
Comparison against main:
libs/langgraph/tests/__snapshots__/test_large_cases_async.ambr#L1
+-----------------------------------------+---------+-----------------------+
| Benchmark | main | changes |
+=========================================+=========+=======================+
| fanout_to_subgraph_100x_checkpoint | 758 ms | 736 ms: 1.03x faster |
+-----------------------------------------+---------+-----------------------+
| fanout_to_subgraph_100x | 608 ms | 597 ms: 1.02x faster |
+-----------------------------------------+---------+-----------------------+
| fanout_to_subgraph_10x_checkpoint | 74.4 ms | 73.1 ms: 1.02x faster |
+-----------------------------------------+---------+-----------------------+
| fanout_to_subgraph_10x_checkpoint_sync | 94.8 ms | 93.6 ms: 1.01x faster |
+-----------------------------------------+---------+-----------------------+
| fanout_to_subgraph_100x_sync | 512 ms | 507 ms: 1.01x faster |
+-----------------------------------------+---------+-----------------------+
| fanout_to_subgraph_100x_checkpoint_sync | 943 ms | 936 ms: 1.01x faster |
+-----------------------------------------+---------+-----------------------+
| react_agent_100x_checkpoint | 835 ms | 829 ms: 1.01x faster |
+-----------------------------------------+---------+-----------------------+
| react_agent_10x_checkpoint_sync | 36.7 ms | 36.4 ms: 1.01x faster |
+-----------------------------------------+---------+-----------------------+
| react_agent_100x | 338 ms | 336 ms: 1.01x faster |
+-----------------------------------------+---------+-----------------------+
| react_agent_100x_sync | 273 ms | 272 ms: 1.00x faster |
+-----------------------------------------+---------+-----------------------+
| wide_state_9x1200 | 26.5 ms | 26.6 ms: 1.01x slower |
+-----------------------------------------+---------+-----------------------+
| wide_state_25x300_sync | 14.7 ms | 14.9 ms: 1.01x slower |
+-----------------------------------------+---------+-----------------------+
| wide_state_9x1200_checkpoint_sync | 305 ms | 321 ms: 1.05x slower |
+-----------------------------------------+---------+-----------------------+
| Geometric mean | (ref) | 1.00x faster |
+-----------------------------------------+---------+-----------------------+
Benchmark hidden because not significant (15): wide_state_15x600_checkpoint, wide_state_15x600_checkpoint_sync, fanout_to_subgraph_10x, react_agent_10x_checkpoint, react_agent_10x, wide_state_25x300_checkpoint_sync, wide_state_15x600_sync, wide_state_9x1200_sync, wide_state_25x300_checkpoint, wide_state_25x300, react_agent_100x_checkpoint_sync, fanout_to_subgraph_10x_sync, wide_state_9x1200_checkpoint, react_agent_10x_sync, wide_state_15x600
|