Adjustments to Token Info Fetcher #1931

nlordell · 2023-10-09T07:47:44Z

Description

In debugging with @fleupold on Friday, we noticed that the token info fetcher was potentially locking up concurrent quotes for certain price estimators, notably the ParaSwap and Quasimodo estimator that make use of token information (decimals specifically). This is caused by two things:

The token info cache will hold an async Mutex for the entire fetching of token information.
We are querying token info for 0xeee...eee in the autopilot which will always fail and not get included in the cache, meaning it will always hold the lock for a node roundtrip.

The combination of these two issue causes quotes that need token information to execute not fully concurrently. In particular, if quote 1 would need token info for A and quote 2 for B, then quote 2 will need to wait for token info of A to finish fetching before starting to fetch token info of B (even though they can run in parallel).

Changes

Implement exception for 0xeee...eee address.
Only don't cache results on node failures and not on contract failures (i.e. if a contract doesn't have decimals, then we will cache this fact instead of querying every time).
Allow concurrent access to the cache, using futures::Shared. I didn't opt for using RequestSharing as it didn't quite fit and the semantics are slightly different.

How to test

Added new test to verify the logic

Related Issues

Fixed CachedTokenInfoFetcher hogs cache Mutex while requests are in flight #1923

fleupold

LGTM

I didn't opt for using RequestSharing as it didn't quite fit and the semantics are slightly different.

What in particular is different about the semantics? Is it that request sharing itself doesn't keep the result around as soon as the future is completed but here we want to store the result indefinitely?

crates/shared/src/token_info.rs

fleupold · 2023-10-09T09:18:13Z

crates/shared/src/token_info.rs

+        let info = fetch.await;
+        if info.is_err() {
+            let mut cache = self.cache.lock().unwrap();
+            if let Some(Err(_)) = cache.get(&address).and_then(|fetch| fetch.peek()) {


What is this condition checking? Can't we just uncoditionally remove the future in case of error?

It is checking that the shared future stored in the map is resolved to an error (with Shared::peek).

Imagine 3 concurrent requests Rn where:

R1 starts a shared fetch of some token information

R2 uses the same shared fetch

The shared future resolves to an error, R1 removes the shared future from the cache

R3 starts a new shared fetch of token information for the same address, the cache entry was removed in 3, so it creates a new one

R2 gets scheduled and removes the new shared future that was created by R3.

This isn't extremely contrived, as there is an await point before clearing the cache, so if R2 doesn't get polled for some time, this would happen. That being said, I don't think this will happen regularly, and at worse we will be slightly sub-optimal and duplicate some queries.

Another approach would be to peek at the shared future when we get the first cache lock and replace it use the shared future in case it is an error. The downside being that we can hold some useless Error(string)s in the cache that won't ever get used.

nlordell · 2023-10-09T09:34:43Z

Is it that request sharing itself doesn't keep the result around as soon as the future is completed but here we want to store the result indefinitely?

Correct, we would use RequestSharing to share the updating of some cache behind a Arc<Mutex<_>>, instead of just keeping a shared future behind an Arc<Mutex<_>>. Overall, I didn't find it made the code clearer, but happy to propose an alternative so others can decide.

MartinquaXD · 2023-10-09T10:19:09Z

crates/shared/src/token_info.rs

-    inner: Box<dyn TokenInfoFetching>,
-    cache: Arc<Mutex<HashMap<H160, TokenInfo>>>,
+    inner: Arc<dyn TokenInfoFetching>,
+    cache: Arc<Mutex<HashMap<H160, SharedTokenInfo>>>,


We might have to keep an eye on memory consumption here.
Storing a whole future instead of a String and a u8 for A LOT of tokens might be a problem. Maybe it makes sense to have sth like:

enum CacheEntry { Resolved(TokenInfo), InFlight(SharedTokenInfoRequest) }

No need to change in this PR, though.

Storing a whole future instead of a String and a u8 for A LOT of tokens might be a problem

Its a boxed future though, so it should just be a ptr + vtable, which is as big as a TokenInfo.

Wow, to my surprise, SharedTokenInfo is smaller than a TokenInfo (without counting the data that exists on the heap): https://play.rust-lang.org/?version=stable&mode=debug&edition=2021&gist=4d4fb54c802e8db652c6608eb9c83d35.

This is likely because of a combination of boxing and null-pointer optimization.

With that in mind, I imagine that this type is fairly space-optimized.

Its a boxed future though, so it should just be a ptr + vtable, which is as big as a TokenInfo.

Yeah. The handle to the future is very small but you'll still keep the actual allocation for the future around which I was mainly worried about.

It shouldn't though right?

Internally, Shared has an UnsafeCell to:

enum FutureOrOutput<Fut: Future> { Future(Fut), Output(Fut::Output), }

Here, Fut = BoxFuture, so its 16 bytes on x86_64 (ptr + vtable). AFAIU, once the unsafe cell gets set to FutureOrOutput::Output, then the future will get dropped right?

Ahh, true. I assumed the SharedFuture would basically keep the future alive the whole time and simply have a special case for returning a resolved value instead of swapping the future allocation for the resolved result. 👌

For example here:

use futures::FutureExt; use std::{ future::Future, pin::Pin, task::{Context, Poll}, }; struct Foo; impl Future for Foo { type Output = i32; fn poll(self: Pin<&mut Self>, _: &mut Context) -> Poll<Self::Output> { Poll::Ready(42) } } impl Drop for Foo { fn drop(&mut self) { println!("Foo::drop"); } } fn main() { let foo = Foo.boxed().shared(); { println!("cloning handles"); for i in 0..3 { let answer = futures::executor::block_on(foo.clone()); println!("answer {i}: {answer}"); } } // We now have a handle to `foo` which was resolved, if we await it, we // shouldn't drop the future again. println!("foo is resolved"); let _ = futures::executor::block_on(foo); }

Output:

cloning handles Foo::drop answer 0: 42 answer 1: 42 answer 2: 42 foo is resolved

The future gets dropped the first time it resolves, and only dropped once.

The only issue would be if there are "dangling" futures in the HashMap. This can be fixed by removing dangling Shared futures that aren't resolved. I also don't suspect this will happen in practice either (especially since follow up token info requests will drive the Shared future forward).

Adjustments to Token Info Fetcher

d9dc1e4

nlordell requested a review from a team as a code owner October 9, 2023 07:47

reduce boxing

7fe9dd3

fleupold mentioned this pull request Oct 9, 2023

CachedTokenInfoFetcher hogs cache Mutex while requests are in flight #1923

Closed

fleupold approved these changes Oct 9, 2023

View reviewed changes

Simplify native asset token information

6c3aedd

MartinquaXD approved these changes Oct 9, 2023

View reviewed changes

Merge branch 'main' into token-info-fetching-fixes

8a8c312

nlordell enabled auto-merge (squash) October 9, 2023 13:08

Merge branch 'main' into token-info-fetching-fixes

de22cab

nlordell merged commit e1da131 into main Oct 9, 2023

nlordell deleted the token-info-fetching-fixes branch October 9, 2023 13:32

github-actions bot locked and limited conversation to collaborators Oct 9, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Adjustments to Token Info Fetcher #1931

Adjustments to Token Info Fetcher #1931

nlordell commented Oct 9, 2023 •

edited

Loading

fleupold left a comment

fleupold Oct 9, 2023

nlordell Oct 9, 2023

nlordell commented Oct 9, 2023

MartinquaXD Oct 9, 2023

nlordell Oct 9, 2023 •

edited

Loading

nlordell Oct 9, 2023

MartinquaXD Oct 9, 2023

nlordell Oct 9, 2023

MartinquaXD Oct 9, 2023

nlordell Oct 9, 2023

Adjustments to Token Info Fetcher #1931

Adjustments to Token Info Fetcher #1931

Conversation

nlordell commented Oct 9, 2023 • edited Loading

Description

Changes

How to test

Related Issues

fleupold left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

nlordell commented Oct 9, 2023

Choose a reason for hiding this comment

nlordell Oct 9, 2023 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

nlordell commented Oct 9, 2023 •

edited

Loading

nlordell Oct 9, 2023 •

edited

Loading