-
Notifications
You must be signed in to change notification settings - Fork 6.3k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
compaction not running or running very slowly when entire db is deleted? #13120
Comments
After the entire DB is deleted, is there any more writes to the DB? RocksDB doesn't have a timer-based trigger to check if there's eligible compactions. Usually it tries to schedule a compaction after a flush/compaction. I suspect that there's nothing to trigger compaction to be scheduled. Can you do a manual compaction after you issue all the deletions? |
there is writes, but my workload is extremely read heavy. The other problem is that the service started using 10 cpus and that is the max number of cpus allocated for the service using cgroups so it started being throttled and made things worse. A few questions:
another data point, the last time this happened, a simple db restart fixed the issue and everything went back to normal |
Do you hold snapshot for a very long time? Restart will clear the snapshots. Tombstone can be in the last level files if there's a snapshot preventing them to be dropped.
If the problem is that files with many tombstones are not being compacted down to the last level, then yes. It won't help if there's snapshot keeping tombstones ailve.
The hope is to compact away the tombstones with manual compaction so that iterators won't use this much CPU. |
my delete workflow is as follows:
Is checkpoint and snapshot the same thing here? Given that, does rocksdb consider the checkpoint to be "held" the entire time? Should I update my clean up service to do something like:
|
Checkpoint is different from snapshot, which can be created by GetSnapshot(). Manual compaction should help. To figure out why ttl compaction was not run, you can dump the SST files rocksdb/include/rocksdb/table_properties.h Line 257 in 0f35db5
|
got it, thank you! |
so for my service, there are times when something happens that causes pretty much the entire db to be deleted using individual delete requests. I'm aware that sending individual delete requests to delete the entire db (instead of a range delete for example) will fill the db with tombstones and pretty much degrade the performance of rocksdb severely as we can see below. What I don't understand is why does it take days for compaction to run/finish compacting all of the tombstones away. As we can see there is not much activity and compaction is not running, I've seen this issue happen in the past and it took a few days for rocksdb to recover/go back to normal after an event like this (deleting pretty much the entire db)
rocksdb config:
OPTIONS-000007.txt
I'm running latest rocksdb version, pretty much default settings, however, I do have db ttl set to a few hours which is more confusing since the docs say:
My understanding is that after the entire db is deleted, once the ttl is up, compaction would run and compact the entire db to get rocksdb back to a normal state.
The text was updated successfully, but these errors were encountered: