First version of turbo cache #2

andectionsharechat · 2023-10-18T17:09:19Z

Context

It's a fast cache fork. It reduces contention by reducing the time spent under the mutex during writing.

Fast cache implementation

Fast cache implementation is pretty simple. The key idea is that all data is stored in an array of chunks of predefined size. All items are written sequentially by this structure: {encoded key&value lengths, key, value}.
There is an index (just a map) which helps to find the item index in the chunks. All operations are executed under Read and Write Mutex. All keys are spread by 512 buckets, which is calculated by hash(key) % bucket count

Turbo cache: overview

Turbocache extends fast cache by several changes

On writing to turbo cache, all new items are put into a channel and processed in the background gorutine. The number of goroutine equals the number of buckets.
The goroutine writes consequences for new items to specific flush chunks. It doesn't need mutex, as all changes happen in the same thread. Periodically, the turbo cache flushes all items to the main chunks
Turbo cache uses specific index for reading items from the flush chunks as well

Turbo cache: configuration

type Config struct {
	//max bytes for storing keys in chunks
	maxBytes int
	// flush intervals to writing keys to the chunks
	flushIntervalMillis int64
	//max batch size for writing in the chunks. batch size 1 make turbo cache to sync cache
	maxWriteBatch int
	//count of the accumulating buffers (chunks) before flush. Every flush chunk has 64KB
	flushChunkCount int
}

func NewConfig(maxBytes int, flushInterval int64, maxWriteBatch int, flushChunks int) *Config {
	.....
}

Principles of thread synchronisation during reading from the index

All shared structures are arrays or splices with fixed-size
On processing new items, all data are appended
All mutations happen only in one goroutine
Cleaning flush chunks && index happens under mutex-free lock

On new key

Search by index for duplication. Code pointer
Write keyValue in flush chunk. It's append-only to an already allocated array, with no memory recopying. Pointer
Update value to chunk with memory barrier. Code pointer
Add a new value to the index. Code pointer. For writing, we first update flushChunkIndex, index inside the array and then only hash (during reading, everything is checked in the opposite order).
Memory barrier for index. Code pointer

On batch and cleaning flush chunks && index

this operation needs to be synchronised with reading from the index. The key idea is 2 step locking.

Set flushing flag for preventing new readings from the index. Code pointer
Take a spin lock for reading. Code pointer
Cleaning index and flush chunks. Code pointer
Release the flushing flag.
Release the spinlock.

On index reading

Reading from index happens only on cache miss

Check if the flushing flag is set. Code pointer
Read index with memory barrier. Code pointer
Search for the key hash in the index. Code pointer
On success, try to acquire the spinlock. Code pointer
On success, find an index to flush the chunk and read data from it. Code pointer

Pointer in the code

Turbo cache: writing implementation

Determine bucket and write to the channel. code pointer
How a background goroutine processes new key. code pointer
How chunks are updated periodically. code pointer

Turbo cache: reading implementation

reading from the index before acquiring mutex. code pointer

Turbo cache: testing

Tests

Tests over public API + tests over updating keys without gourutine processes. All tests with different batch size
All tests are running with --race argument

Turbo cache: benchmarking

During developing, there were benchmarks with comparative CPU time over mutex
It misses the comparison between sync fast cache and async cache

Turbo cache: QA

What's memory overhead?

it's an overhead with flush chunks + index. Overhead: 512 (bucket count) * 64KB * flushChunkCount + maxBatchSize * 84B. It's about 33MB for flushChunkCount=1, maxBatchSize=128

andectionsharechat added 30 commits July 24, 2023 18:58

introduced private set without lock

32e2517

naive batch write implementation

eb82c00

small cleans

18a0729

fixed writing cache error

1eef94d

updated go modele name

675daf7

increased timer interval

b173153

updated parameters a bit

1638d94

tuned contention parameters

cf5459d

removed useless set

5f66907

updated package name

250649c

fixed tests

2b137a7

fixed tests

254b7f2

updated module name

2cc0b1b

fixed big cache panic

28aff5b

fixed all tests

0b54936

changed bucket to 512

0c369fe

fixed tests

75755e0

fixed race condition

45f5fb1

increase go version

02823fe

fixed go version in ci

3d79619

fixed test

5da66c8

code clean

575c5b2

code clean

fd775cd

cleaned code

182b934

fixing tests

81828ae

code clean

c745332

stabilize tests

3dcc29a

added sync write

835b2ac

simplifed tests

0808cce

drop on high contention

7e2ea4a

andectionsharechat added 30 commits November 1, 2023 22:36

code clean

45f5e76

code clean

81da1f5

release queued struct faster

eb9dac7

code clean

d036d32

code clean

e053095

increment refactoring

e6a065f

increment refactoring

32261f6

increment refactoring

80efe16

fixed test

292b404

incremental refactoring

d695dd8

stopped dropping writes on flushs overflow

512a2a5

code clean

61a148b

increment refactoring

82e3435

increment refactoring

9299153

increment refactoring

904350d

remove unnececcary sync primitive

a591f0e

incremental refactoring

d49f268

incremental refactoring

c553f92

incremental refactoring

fb6ca1a

incremental refactoring

2e4d97d

incremental refactoring

18d4ca1

incremental refactoring

863638c

go mod tidy

48ab725

removed vendor folder

45a6c80

fixed race condition

4f5f4c7

add mutex waits in benchmarks

93b043f

rename refactoring

9345432

fixed units

472b51f

clean code

83b031a

fixed sync write bug

f5afb97

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

First version of turbo cache #2

First version of turbo cache #2

andectionsharechat commented Oct 18, 2023 •

edited by david-sharechat

Loading

First version of turbo cache #2

Are you sure you want to change the base?

First version of turbo cache #2

Conversation

andectionsharechat commented Oct 18, 2023 • edited by david-sharechat Loading

Context

Fast cache implementation

Turbo cache: overview

Turbo cache: configuration

Principles of thread synchronisation during reading from the index

On new key

On batch and cleaning flush chunks && index

On index reading

Pointer in the code

Turbo cache: writing implementation

Turbo cache: reading implementation

Turbo cache: testing

Turbo cache: benchmarking

Turbo cache: QA

andectionsharechat commented Oct 18, 2023 •

edited by david-sharechat

Loading