Expose More Performance-Related Configurations #630

coltmcnealy-lh · 2024-01-28T20:58:34Z

No description provided.

coltmcnealy-lh · 2024-01-28T21:06:01Z

Doing testing with the defaults.

Core Stream Threads: 2
Timer Stream Threads: 2
Core Stream Commit Interval: 5 seconds
Timer Stream Commit Interval: 30 seconds
Core Memtable Size: 64MB
Timer Memtable Size: 32MB
Core State Store Cache Size: 32MB
Timer State Store Cache Size: 64MB
Shared RocksDB Block Cache Size: 64MB
Write Buffer Manager Size: 256MB

12:54:22 -> for i in $(seq 1 1000); do     lhctl run hundred-tasks; done

It completed in 10 minutes 3 seconds. Yikes. Looked like a lot of RocksDB Write Stalls. Most likely caused by the Write Buffer Manager Size being too small.

coltmcnealy-lh · 2024-01-28T21:49:41Z

Core Stream Threads: 2
Timer Stream Threads: 2
Core Stream Commit Interval: 5 seconds
Timer Stream Commit Interval: 30 seconds
Core Memtable Size: 64MB
Timer Memtable Size: 32MB
Core State Store Cache Size: 32MB
Timer State Store Cache Size: 64MB
Shared RocksDB Block Cache Size: 64MB
Write Buffer Manager Size: 4GB

This also took ~10 minutes. Turns out that this ticket is blocked by #576

coltmcnealy-lh · 2024-01-28T22:17:40Z

server/src/main/java/io/littlehorse/common/LHServerConfig.java

We might want to open a tech-debt ticket to make this class smaller.

coltmcnealy-lh · 2024-01-28T22:53:43Z

I disabled metrics aggregation (MetricsUpdater#listen()) and then ran the benchmark again. Prior to the changes in this PR, the benchmark took 119 seconds. After the changes, running with larger-memory.config took 113 seconds.

I disabled the changes related to the WriteBufferManager and Block Cache size (no shared WBM or Block Cache). Keeping the changes to commit interval and state store cache kept the small performance improvement (113 seconds). My thoughts going into this PR were confirmed:

The changes to the WriteBufferManager and Block Cache are intended to reduce the risk of OOM errors.
Using a larger commit interval should result in better performance. This performance change should be more drastic when running in a replicated environment with network latency (I am using local dev right now).

I think this PR is ready to review on its own. We should define follow-up issues to:

Enable jemalloc on our docker image
Determine recommended settings for the various cache sizes based on a given amount of memory.

My profiling has confirmed that this ticket accomplishes what it needed to do—power users can mostly limit the total on-heap and off-heap memory by tuning configurations, and we also improved performance slightly in the case where more resources are provided to the server.

The only danger of this PR is that the new default configurations result in poorer performance than the old default configurations, because for certain types of memory the old configurations did not specify a limit (which is dangerous as it makes OOM more easy to occur)

…m with no limits.

…s-configurations-with-smart-guardrails

…allow-user-to-set-streams-configurations-with-smart-guardrails

coltmcnealy-lh · 2024-01-31T04:26:06Z

server/src/main/java/io/littlehorse/common/util/RocksConfigSetter.java

+    // Need to inject the LHServerConfig, but Kafka Streams requires we pass in a Class with
+    // a default constructor. So the only way to do that is to have a static singleton which
+    // we set elsewhere in the code.
+    public static LHServerConfig serverConfig;


@mjsax this appears to be the only way to "inject" configuration into the RockDBConfigSetter implementation. Would it be worth a KIP to add a method:

KafkaStreams#setRocksDBConfigurator(RocksDBConfigSetter otterSetter)

The method should throw IllegalStateException if it is called after KafkaStreams#start(), or if the rocksdb.config.setter parameter is set on the properties.

Don't think it's the only way to inject configs.

There is RocksConfigSetter.setConfig(..., config) taking the Kafka Streams config object and third parameter. When you create StreamsConfig you pass in Map<?, ?> props and all entries will be contained in config.

Thus, you can take LHServerConfig and add all of it to props before creating StreamsConfig and should have access to all of it in setConfig(...)

@mjsax Thanks so much! This works great. Much better than submitting a KIP to the Apache Kafka Debate Club 😄

CC @eduwercamacaro let's refactor this tomorrow.

coltmcnealy-lh linked an issue Jan 28, 2024 that may be closed by this pull request

Intelligently allow user to set Streams configurations with smart guardrails. #478

Closed

coltmcnealy-lh marked this pull request as ready for review January 28, 2024 22:14

coltmcnealy-lh requested review from sauljabin and eduwercamacaro January 28, 2024 22:14

coltmcnealy-lh commented Jan 28, 2024

View reviewed changes

coltmcnealy-lh mentioned this pull request Jan 29, 2024

Use jemalloc in LH Server Docker Image #634

Closed

coltmcnealy-lh added 7 commits January 29, 2024 11:01

Expose More Performance-Related Configurations

43548a8

revert changes related to jemalloc

707c571

Adds a larger-memory config with larger bounds for the state store sizes

a21b625

reverts unnecessary changes to keep PR smaller

9e0d853

Fixes typo

bdbebc0

bug with state directory

a6d0f06

When Block Cache Size or Total Memtable Size (WBM) not set, leave the…

ffcfc8d

…m with no limits.

coltmcnealy-lh force-pushed the 478-intelligently-allow-user-to-set-streams-configurations-with-smart-guardrails branch from 07880ec to ffcfc8d Compare January 29, 2024 19:01

coltmcnealy-lh added 6 commits January 29, 2024 11:04

Updates server configuration docs to reflect new defaults

8af81e4

read_uncommitted for timer topology consumer

925f461

reduces core streams commit interval to see if this helps flakey test

06fcb84

unable to get two-server e2e to work

572bcc5

Merge branch 'master' into 478-intelligently-allow-user-to-set-stream…

92ab25b

…s-configurations-with-smart-guardrails

Merge branch '626-e2e-tests-multiple-servers' into 478-intelligently-…

57a1365

…allow-user-to-set-streams-configurations-with-smart-guardrails

eduwercamacaro approved these changes Jan 30, 2024

View reviewed changes

coltmcnealy-lh added 2 commits January 30, 2024 20:10

backs out changes to e2e pipeline

59cd084

removes unneded file

47c7cb8

coltmcnealy-lh commented Jan 31, 2024

View reviewed changes

coltmcnealy-lh merged commit c79af27 into master Jan 31, 2024
12 checks passed

coltmcnealy-lh deleted the 478-intelligently-allow-user-to-set-streams-configurations-with-smart-guardrails branch January 31, 2024 04:26

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Expose More Performance-Related Configurations #630

Expose More Performance-Related Configurations #630

coltmcnealy-lh commented Jan 28, 2024

coltmcnealy-lh commented Jan 28, 2024 •

edited

Loading

coltmcnealy-lh commented Jan 28, 2024

coltmcnealy-lh Jan 28, 2024

coltmcnealy-lh commented Jan 28, 2024

coltmcnealy-lh Jan 31, 2024

mjsax Jan 31, 2024

coltmcnealy-lh Feb 1, 2024

Expose More Performance-Related Configurations #630

Expose More Performance-Related Configurations #630

Conversation

coltmcnealy-lh commented Jan 28, 2024

coltmcnealy-lh commented Jan 28, 2024 • edited Loading

coltmcnealy-lh commented Jan 28, 2024

coltmcnealy-lh Jan 28, 2024

Choose a reason for hiding this comment

coltmcnealy-lh commented Jan 28, 2024

coltmcnealy-lh Jan 31, 2024

Choose a reason for hiding this comment

mjsax Jan 31, 2024

Choose a reason for hiding this comment

coltmcnealy-lh Feb 1, 2024

Choose a reason for hiding this comment

coltmcnealy-lh commented Jan 28, 2024 •

edited

Loading