Optimize Index cache to avoid inverted Index file loading on cache miss #5011

WenyXu · 2024-11-18T06:41:41Z

What type of enhancement is this?

Performance

What does the enhancement do?

In our current system, an index cache miss triggers the loading of the entire index file from s3, regardless of the actual amount of data required. This behavior causes significant inefficiencies in scenarios where only a small portion of the index file is needed for the query.

In some cases, only a small fraction of the index data is accessed. Despite this, the system downloads the entire 200 MiB index file. The download time dominates the query's total execution time, leading to degraded performance.

Implementation challenges

No response

CookiePieWw · 2024-12-09T03:42:33Z

Possible solution:

Currently the cache in mito can be divided into 2 kinds:

Cache for relatively small structs like metadata
- SstMetaCache, VectorCache, metadata in InvertedIndexCache
Cache for some large content
- PageCache, contents in InvertedIndexCache

For the first, we could keep the current cache management strategy, which means the caller is responsible for fetch values and put cache. We can provide a register method and a pair of getter/setter for extensity:

cache_manager.register_structured(TYPE, ...);
cache_manager.put(TYPE, key, value);
cache_manager.get(TYPE, key);

For the second, we could provide a page-based cache strategy. Here the upper caller simply gives offsets and size, and the cache manager is responsible for featching values and put cache. Similar to #5114

cache_manager.register_paged(TYPE, page_size, reader, ...);
cache_manager.read(TYPE, offset, size, ...);

The page-based cache fetches and caches remote files in fixed-size pages, and returns Vec<u8> as results.

Possible impl:

pub struct StructuredCache<K, V>;

pub struct PagedCache<K>;

pub trait PageReader;

pub trait PageKey {
    fn offset_to_keys(size, offset);
}

pub enum MitoCache;

pub struct CacheManager {
    cache: HashMap<String, MitoCache>;
}

killme2008 · 2025-01-04T09:53:18Z

@WenyXu @CookiePieWw What's the progress of this issue?

CookiePieWw · 2025-01-04T10:10:17Z

The cache granularity of inverted index has been changed to a fixed size of page in #5114, and several optimizations have been introduced in #5145, #5146, #5147 and #5148.

I think we can close the issue for now? cc @WenyXu

WenyXu added the C-enhancement Category Enhancements label Nov 18, 2024

CookiePieWw mentioned this issue Dec 7, 2024

refactor: cache inverted index with fixed-size page #5114

Merged

3 tasks

WenyXu added this to the v0.12 milestone Dec 10, 2024

WenyXu assigned CookiePieWw Dec 10, 2024

WenyXu changed the title ~~Optimize Index cache to avoid full Index file loading on cache miss~~ Optimize Index cache to avoid inverted Index file loading on cache miss Dec 10, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Optimize Index cache to avoid inverted Index file loading on cache miss #5011

Optimize Index cache to avoid inverted Index file loading on cache miss #5011

WenyXu commented Nov 18, 2024

CookiePieWw commented Dec 9, 2024 •

edited

Loading

killme2008 commented Jan 4, 2025

CookiePieWw commented Jan 4, 2025

Optimize Index cache to avoid inverted Index file loading on cache miss #5011

Optimize Index cache to avoid inverted Index file loading on cache miss #5011

Comments

WenyXu commented Nov 18, 2024

What type of enhancement is this?

What does the enhancement do?

Implementation challenges

CookiePieWw commented Dec 9, 2024 • edited Loading

killme2008 commented Jan 4, 2025

CookiePieWw commented Jan 4, 2025

CookiePieWw commented Dec 9, 2024 •

edited

Loading