feat(bloom): store running aggregate in memory #2367

sistemd · 2024-11-12T14:23:19Z

Closes #2354.
Closes #2355.

kkovaacs

Looks great, thanks!

BTW, do you have some rough estimates on how long reconstructing the running aggregate will take when starting up pathfinder?

sistemd · 2024-11-13T22:40:03Z

Looks great, thanks!

BTW, do you have some rough estimates on how long reconstructing the running aggregate will take when starting up pathfinder?

Yeah, it seems to slow down our startup time by a few seconds on average. If that's too much, a good middle ground might be to store the running aggregate filter, but do it less often than the filter's entire range. In other words: aggregate filters cover 32k blocks each, and we can store the running aggregate filter for example after every 8k or 4k blocks. This is a middle ground between the approach implemented by this PR and the approach which was in place previously (and actually thinking about it, this might be really easy to implement).

I'll let you know exactly how long the reconstruction takes in the worst possible case (when 32k - 1 blocks need to be reconstructed).

kkovaacs · 2024-11-14T07:49:30Z

Yeah, it seems to slow down our startup time by a few seconds on average. If that's too much, a good middle ground might be to store the running aggregate filter, but do it less often than the filter's entire range. In other words: aggregate filters cover 32k blocks each, and we can store the running aggregate filter for example after every 8k or 4k blocks. This is a middle ground between the approach implemented by this PR and the approach which was in place previously (and actually thinking about it, this might be really easy to implement).

I'll let you know exactly how long the reconstruction takes in the worst possible case (when 32k - 1 blocks need to be reconstructed).

I think a few seconds should be totally fine.

CHr15F0x · 2024-11-14T09:21:17Z

crates/storage/src/connection/event.rs

+            let running_aggregate = match self.running_aggregate.lock() {
+                Ok(guard) => guard,
+                Err(poisoned) => {
+                    tracing::error!("Poisoned lock in load_aggregate_bloom_range");
+                    poisoned.into_inner()
+                }
+            };


We don't have a uniform way of locking mutexes in the code and it's high time we had one. 🤔

sistemd requested a review from a team as a code owner November 12, 2024 14:23

kkovaacs approved these changes Nov 13, 2024

View reviewed changes

CHr15F0x reviewed Nov 14, 2024

View reviewed changes

CHr15F0x approved these changes Nov 14, 2024

View reviewed changes

running aggregate stored in memory

6f74d53

sistemd force-pushed the sistemd/running-aggregate-in-memory branch from 1a3205d to 6f74d53 Compare November 14, 2024 16:24

sistemd merged commit 0f6654a into main Nov 14, 2024
7 checks passed

sistemd deleted the sistemd/running-aggregate-in-memory branch November 14, 2024 16:40

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat(bloom): store running aggregate in memory #2367

feat(bloom): store running aggregate in memory #2367

sistemd commented Nov 12, 2024 •

edited

Loading

kkovaacs left a comment

sistemd commented Nov 13, 2024 •

edited

Loading

kkovaacs commented Nov 14, 2024

CHr15F0x Nov 14, 2024

feat(bloom): store running aggregate in memory #2367

feat(bloom): store running aggregate in memory #2367

Conversation

sistemd commented Nov 12, 2024 • edited Loading

kkovaacs left a comment

Choose a reason for hiding this comment

sistemd commented Nov 13, 2024 • edited Loading

kkovaacs commented Nov 14, 2024

CHr15F0x Nov 14, 2024

Choose a reason for hiding this comment

sistemd commented Nov 12, 2024 •

edited

Loading

sistemd commented Nov 13, 2024 •

edited

Loading