[exporter]prometheusremotewrite] Fix data race by introducing pool of batch state #36601

ArthurSens · 2024-11-29T13:07:34Z

Description

This is an alternative for #36524 and #36600

This PR does a couple of things:

Add a test written by @edma2 that shows a data race to the batch state when running multiple consumers.
Add a benchmark for PushMetrics, with options to run with a stable number of metrics or varying metrics.
Fix the data race by introducing a sync.Pool of batch states.

Benchmark results

results comparing main, #36600 and this PR:

arthursens$ benchstat main.txt withoutState.txt syncpool.txt 
goos: darwin
goarch: arm64
pkg: github.com/open-telemetry/opentelemetry-collector-contrib/exporter/prometheusremotewriteexporter
cpu: Apple M2 Pro
                            │  main.txt   │          withoutState.txt           │         syncpool.txt         │
                            │   sec/op    │    sec/op     vs base               │   sec/op     vs base         │
PushMetricsVaryingMetrics-2   8.066m ± 5%   13.821m ± 9%  +71.36% (p=0.002 n=6)   8.316m ± 6%  ~ (p=0.065 n=6)

                            │   main.txt   │           withoutState.txt            │            syncpool.txt            │
                            │     B/op     │     B/op       vs base                │     B/op      vs base              │
PushMetricsVaryingMetrics-2   5.216Mi ± 0%   34.436Mi ± 0%  +560.17% (p=0.002 n=6)   5.548Mi ± 0%  +6.36% (p=0.002 n=6)

                            │  main.txt   │       withoutState.txt       │         syncpool.txt         │
                            │  allocs/op  │  allocs/op   vs base         │  allocs/op   vs base         │
PushMetricsVaryingMetrics-2   56.02k ± 0%   56.05k ± 0%  ~ (p=0.721 n=6)   56.04k ± 0%  ~ (p=0.665 n=6)

Signed-off-by: Arthur Silva Sens <[email protected]>

Signed-off-by: Arthur Silva Sens <[email protected]> (cherry picked from commit bdeb254)

Signed-off-by: Arthur Silva Sens <[email protected]>

Aneurysm9 · 2024-12-01T01:03:29Z

This is strange to me. I don't see any degradations in memory allocations by removing the state, so I'm confused if the optimization ever made any difference or if I'm doing something wrong in the benchmark.

The original issue talked about batch sizes of 100-200k data points, so maybe 10k isn't enough to illustrate the issue?

Signed-off-by: Arthur Silva Sens <[email protected]> (cherry picked from commit 2dc1870)

ArthurSens · 2024-12-02T14:36:38Z

I've increased the amount of data used in the benchmarks, still no difference.

arthursens$ benchstat main.txt withoutState.txt syncpool.txt 
goos: darwin
goarch: arm64
pkg: github.com/open-telemetry/opentelemetry-collector-contrib/exporter/prometheusremotewriteexporter
cpu: Apple M2 Pro
                            │  main.txt   │       withoutState.txt       │         syncpool.txt         │
                            │   sec/op    │   sec/op     vs base         │   sec/op     vs base         │
PushMetricsVaryingMetrics-2   676.0m ± 5%   685.0m ± 8%  ~ (p=0.240 n=6)   682.9m ± 6%  ~ (p=0.310 n=6)

                            │   main.txt   │       withoutState.txt        │         syncpool.txt          │
                            │     B/op     │     B/op      vs base         │     B/op      vs base         │
PushMetricsVaryingMetrics-2   431.0Mi ± 0%   431.2Mi ± 0%  ~ (p=0.937 n=6)   430.9Mi ± 0%  ~ (p=0.132 n=6)

                            │  main.txt   │        withoutState.txt        │           syncpool.txt            │
                            │  allocs/op  │  allocs/op   vs base           │  allocs/op   vs base              │
PushMetricsVaryingMetrics-2   4.334M ± 0%   4.334M ± 0%  ~ (p=1.000 n=6) ¹   4.334M ± 0%  +0.00% (p=0.002 n=6)
¹ all samples are equal

Individual benchmark runs

Main:

goos: darwin
goarch: arm64
pkg: github.com/open-telemetry/opentelemetry-collector-contrib/exporter/prometheusremotewriteexporter
cpu: Apple M2 Pro
BenchmarkPushMetricsVaryingMetrics-2   	     100	 641808841 ns/op	451837730 B/op	 4333723 allocs/op
BenchmarkPushMetricsVaryingMetrics-2   	     100	 680346175 ns/op	452476772 B/op	 4333723 allocs/op
BenchmarkPushMetricsVaryingMetrics-2   	     100	 673599777 ns/op	451837744 B/op	 4333723 allocs/op
BenchmarkPushMetricsVaryingMetrics-2   	     100	 674459682 ns/op	451847461 B/op	 4333723 allocs/op
BenchmarkPushMetricsVaryingMetrics-2   	     100	 677447471 ns/op	452476337 B/op	 4333723 allocs/op
BenchmarkPushMetricsVaryingMetrics-2   	     100	 687872721 ns/op	452075896 B/op	 4333723 allocs/op
PASS
ok  	github.com/open-telemetry/opentelemetry-collector-contrib/exporter/prometheusremotewriteexporter	543.110s

Alternative 1 (#36600):

goos: darwin
goarch: arm64
pkg: github.com/open-telemetry/opentelemetry-collector-contrib/exporter/prometheusremotewriteexporter
cpu: Apple M2 Pro
BenchmarkPushMetricsVaryingMetrics-2   	     100	 632321270 ns/op	452227454 B/op	 4333723 allocs/op
BenchmarkPushMetricsVaryingMetrics-2   	     100	 688034830 ns/op	451837198 B/op	 4333723 allocs/op
BenchmarkPushMetricsVaryingMetrics-2   	     100	 685106043 ns/op	452075646 B/op	 4333723 allocs/op
BenchmarkPushMetricsVaryingMetrics-2   	     100	 680661115 ns/op	451837636 B/op	 4333723 allocs/op
BenchmarkPushMetricsVaryingMetrics-2   	     100	 684886080 ns/op	452237544 B/op	 4333723 allocs/op
BenchmarkPushMetricsVaryingMetrics-2   	     100	 686308490 ns/op	452876716 B/op	 4333723 allocs/op
PASS
ok  	github.com/open-telemetry/opentelemetry-collector-contrib/exporter/prometheusremotewriteexporter	544.852s

This branch:

goos: darwin
goarch: arm64
pkg: github.com/open-telemetry/opentelemetry-collector-contrib/exporter/prometheusremotewriteexporter
cpu: Apple M2 Pro
BenchmarkPushMetricsVaryingMetrics-2   	     100	 641808841 ns/op	451837730 B/op	 4333723 allocs/op
BenchmarkPushMetricsVaryingMetrics-2   	     100	 680346175 ns/op	452476772 B/op	 4333723 allocs/op
BenchmarkPushMetricsVaryingMetrics-2   	     100	 673599777 ns/op	451837744 B/op	 4333723 allocs/op
BenchmarkPushMetricsVaryingMetrics-2   	     100	 674459682 ns/op	451847461 B/op	 4333723 allocs/op
BenchmarkPushMetricsVaryingMetrics-2   	     100	 677447471 ns/op	452476337 B/op	 4333723 allocs/op
BenchmarkPushMetricsVaryingMetrics-2   	     100	 687872721 ns/op	452075896 B/op	 4333723 allocs/op
PASS
ok  	github.com/open-telemetry/opentelemetry-collector-contrib/exporter/prometheusremotewriteexporter	543.110s

dashpole · 2024-12-02T18:46:13Z

Seems like a benchmarking problem. Lets try to work with the original author in the other PR before we choose a path forward.

ArthurSens · 2024-12-05T12:49:35Z

Seems like a benchmarking problem.

That's for sure! I'm struggling to understand why the results look that way :/

Signed-off-by: Arthur Silva Sens <[email protected]> (cherry picked from commit b5a9c1d)

ArthurSens · 2024-12-13T19:15:32Z

Okay, I found the problem. I'm creating a large request by creating an OTel metric with millions of Metrics with similar names and attributes within the same ResourceMetrics. This ends up in a very small tsMap here. Even in the test case with 1 million datapoints we only get three entries in tsMap.

If we go a bit further and look at the code of batchTimeSeries, even if one single entry of tsMap holds 1 million datapoints (around 18MB) and the max batchSize is super small. Those 1 million datapoints become one single PRW request:

opentelemetry-collector-contrib/exporter/prometheusremotewriteexporter/helper.go

Lines 47 to 54 in d72bbd2

    
           if sizeOfCurrentBatch+sizeOfSeries >= maxBatchByteSize { 
        
           	state.nextTimeSeriesBufferSize = max(10, 2*len(tsArray)) 
        
           	wrapped := convertTimeseriesToRequest(tsArray) 
        
           	requests = append(requests, wrapped) 
        
           	tsArray = make([]prompb.TimeSeries, 0, min(state.nextTimeSeriesBufferSize, len(tsMap)-i)) 
        
           	sizeOfCurrentBatch = 0 
        
           }

That happens because converTimeseriesToRequest is not aware of maxBatchSize. I prefer that we fix this bug in a follow-up PR.

To progress the benchmark here, even with the bug, we can diversify the attributes in the original OTel metrics during the benchmark preparation, leading to millions of entries in tsMap. I can even decrease the amount of series again and it still works :)

ArthurSens · 2024-12-13T19:28:03Z

Alright, new results comparing main, #36600 and this PR:

arthursens$ benchstat main.txt withoutState.txt syncpool.txt 
goos: darwin
goarch: arm64
pkg: github.com/open-telemetry/opentelemetry-collector-contrib/exporter/prometheusremotewriteexporter
cpu: Apple M2 Pro
                            │  main.txt   │          withoutState.txt           │         syncpool.txt         │
                            │   sec/op    │    sec/op     vs base               │   sec/op     vs base         │
PushMetricsVaryingMetrics-2   8.066m ± 5%   13.821m ± 9%  +71.36% (p=0.002 n=6)   8.316m ± 6%  ~ (p=0.065 n=6)

                            │   main.txt   │           withoutState.txt            │            syncpool.txt            │
                            │     B/op     │     B/op       vs base                │     B/op      vs base              │
PushMetricsVaryingMetrics-2   5.216Mi ± 0%   34.436Mi ± 0%  +560.17% (p=0.002 n=6)   5.548Mi ± 0%  +6.36% (p=0.002 n=6)

                            │  main.txt   │       withoutState.txt       │         syncpool.txt         │
                            │  allocs/op  │  allocs/op   vs base         │  allocs/op   vs base         │
PushMetricsVaryingMetrics-2   56.02k ± 0%   56.05k ± 0%  ~ (p=0.721 n=6)   56.04k ± 0%  ~ (p=0.665 n=6)

It shows that introducing a pool of states is the better option here, removing the state would introduce a big performance degradation.

Signed-off-by: Arthur Silva Sens <[email protected]>

ArthurSens · 2024-12-16T12:43:12Z

I believe the "Check codeowners" CI is not related to the changes here, right?

ArthurSens added 3 commits November 25, 2024 14:45

Add test to show data race in batchTimeSeries()

2052bd1

Signed-off-by: Arthur Silva Sens <[email protected]>

Add benchmark for PushMetrics()

f54684c

Signed-off-by: Arthur Silva Sens <[email protected]> (cherry picked from commit bdeb254)

Fix data race by using a state pool

58930f9

Signed-off-by: Arthur Silva Sens <[email protected]>

ArthurSens requested review from dashpole and a team as code owners November 29, 2024 13:07

github-actions bot assigned MovieStoreGuy Nov 29, 2024

github-actions bot added the exporter/prometheusremotewrite label Nov 29, 2024

github-actions bot requested review from Aneurysm9 and rapphil November 29, 2024 13:07

This was referenced Nov 29, 2024

[exporter/prometheusremotewrite] Fix data race for batch state by removing the state altogether #36600

Closed

[exporter/prometheusremotewrite] Fix data race in batch series state if called concurrently #36524

Closed

Increase data used in benchmark

c0329a7

Signed-off-by: Arthur Silva Sens <[email protected]> (cherry picked from commit 2dc1870)

Update benchmark to create more diverse timeseries

dab33e5

Signed-off-by: Arthur Silva Sens <[email protected]> (cherry picked from commit b5a9c1d)

ArthurSens added 2 commits December 13, 2024 16:32

Add changelog

40fe165

Signed-off-by: Arthur Silva Sens <[email protected]>

lint

cc20caf

Signed-off-by: Arthur Silva Sens <[email protected]>

dashpole approved these changes Dec 16, 2024

View reviewed changes

ArthurSens mentioned this pull request Dec 16, 2024

feat/re-allow multiple workers #36134

Open

Aneurysm9 approved these changes Dec 16, 2024

View reviewed changes

dashpole added the ready to merge Code review completed; ready to merge by maintainers label Dec 16, 2024

This was referenced Dec 17, 2024

Weekly Report: 2024-12-10 - 2024-12-17 #36867

Open

Weekly Report: 2024-12-17 - 2024-12-24 #36929

Open

bogdandrutu merged commit 33c5306 into open-telemetry:main Dec 26, 2024
170 of 172 checks passed

github-actions bot added this to the next release milestone Dec 26, 2024

ArthurSens deleted the prwexporter-batch-statepool branch December 27, 2024 12:53

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[exporter]prometheusremotewrite] Fix data race by introducing pool of batch state #36601

[exporter]prometheusremotewrite] Fix data race by introducing pool of batch state #36601

ArthurSens commented Nov 29, 2024 •

edited

Loading

Aneurysm9 commented Dec 1, 2024

ArthurSens commented Dec 2, 2024

dashpole commented Dec 2, 2024

ArthurSens commented Dec 5, 2024

ArthurSens commented Dec 13, 2024

ArthurSens commented Dec 13, 2024

ArthurSens commented Dec 16, 2024

[exporter]prometheusremotewrite] Fix data race by introducing pool of batch state #36601

[exporter]prometheusremotewrite] Fix data race by introducing pool of batch state #36601

Conversation

ArthurSens commented Nov 29, 2024 • edited Loading

Description

Benchmark results

Aneurysm9 commented Dec 1, 2024

ArthurSens commented Dec 2, 2024

dashpole commented Dec 2, 2024

ArthurSens commented Dec 5, 2024

ArthurSens commented Dec 13, 2024

ArthurSens commented Dec 13, 2024

ArthurSens commented Dec 16, 2024

ArthurSens commented Nov 29, 2024 •

edited

Loading