Refactor FeatureFlags #17611

finnegancarroll · 2025-03-17T18:45:03Z

Description

There are a handful of competing ways to set feature flags today.

FeatureFlags.initializeFeatureFlags
Takes a setting object and overwrites FeatureFlags.settings. Node invokes this to set flags on startup, but it is also used throughout unit tests to dynamically update and clear feature flags. Provides no way to set a single flag without rewriting all settings.
System.setProperty:
Sets the feature flag as a JVM property. Calls to FeatureFlags.isEnabled() will check System.getProperty() and see this flag. Allows dynamic setting of individual feature flags.
FeatureFlagSetter.set:
A small wrapper around the above System.setProperty() which additionally tracks which flags it sets. Used only in tests. Tracks set properties and exposes clearAll() to unset all tracked properties which is invoked during OpenSearchTestCase teardown to ensure a "clean slate" for each test.

Together these implementations provide the desired functionality for feature flags across test and non-test use cases but present some issues:

System properties persist for the lifetime of the JVM and can carry over to other tests run on the same JVM, polluting other test environments if not cleared.
System.getProperty() can be slow and have performance implications when feature flags are retrieved in a hot path.
When not using System.setProperty(), clearing or setting any single feature flag resets all flags.

This PR refactors FeatureFlags to consolidate feature flag functionality and remove usage of JVM system properties in all cases except initializeFeatureFlags which is called once by Node on startup.

Specific changes include:

Replaces FeatureFlags immutable internal settings with a map.
initializeFeatureFlags now reads and sets feature flags from JVM system properties.
isEnabled no longer checks JVM system properties to mitigate performance impact.
For test use cases, functionality for setting/unsetting flags is provided behind FeatureFlags.TestUtils for clarity.
- @LockFeatureFlag annotation provided to set the value of a feature flag and make it immutable for the duration of the test case.
- FeatureFlags.TestUtils.with for execution of a single runnable with feature flag enabled.
- FeatureFlags.TestUtils.FlagWriteLock for directly accessing a feature flag's write lock.

Related Issues

Resolves #16519

Check List

Functionality includes testing.
API changes companion pull request created, if applicable.
Public documentation issue/PR created, if applicable.

By submitting this pull request, I confirm that my contribution is made under the terms of the Apache 2.0 license.
For more information on following Developer Certificate of Origin and signing off your commits, please check here.

github-actions · 2025-03-17T19:03:07Z

❌ Gradle check result for fdec16b: FAILURE

Please examine the workflow log, locate, and copy-paste the failure(s) below, then iterate to green. Is the failure a flaky test unrelated to your change?

github-actions · 2025-03-17T19:37:56Z

❌ Gradle check result for f1d5490: FAILURE

Please examine the workflow log, locate, and copy-paste the failure(s) below, then iterate to green. Is the failure a flaky test unrelated to your change?

github-actions · 2025-03-17T19:56:32Z

❌ Gradle check result for 1228bda: FAILURE

Please examine the workflow log, locate, and copy-paste the failure(s) below, then iterate to green. Is the failure a flaky test unrelated to your change?

github-actions · 2025-03-17T20:26:56Z

❌ Gradle check result for 4a3e60f: FAILURE

Please examine the workflow log, locate, and copy-paste the failure(s) below, then iterate to green. Is the failure a flaky test unrelated to your change?

finnegancarroll · 2025-03-17T20:48:23Z

Verifying isEnabled is represented less in big5 term flame graph per #16519.

Feature branch:

Main branch:

github-actions · 2025-03-18T20:57:01Z

❌ Gradle check result for 4bc33cd: FAILURE

Please examine the workflow log, locate, and copy-paste the failure(s) below, then iterate to green. Is the failure a flaky test unrelated to your change?

github-actions · 2025-03-18T21:20:33Z

❌ Gradle check result for 94fd7af: FAILURE

Please examine the workflow log, locate, and copy-paste the failure(s) below, then iterate to green. Is the failure a flaky test unrelated to your change?

github-actions · 2025-03-18T22:50:18Z

❌ Gradle check result for e8ad6be: FAILURE

Please examine the workflow log, locate, and copy-paste the failure(s) below, then iterate to green. Is the failure a flaky test unrelated to your change?

finnegancarroll · 2025-03-18T22:53:06Z

{"run-benchmark-test": "id_5"}

github-actions · 2025-03-18T23:08:33Z

The Jenkins job url is https://build.ci.opensearch.org/job/benchmark-pull-request/2624/ . Final results will be published once the job is completed.

finnegancarroll · 2025-03-18T23:40:40Z

{"run-benchmark-test": "id_3"}

opensearch-ci-bot · 2025-03-18T23:53:27Z

Benchmark Results

Benchmark Results for Job: https://build.ci.opensearch.org/job/benchmark-pull-request/2624/

Metric	Task	Value	Unit
Cumulative indexing time of primary shards		69.6455	min
Min cumulative indexing time across primary shards		69.6455	min
Median cumulative indexing time across primary shards		69.6455	min
Max cumulative indexing time across primary shards		69.6455	min
Cumulative indexing throttle time of primary shards		0	min
Min cumulative indexing throttle time across primary shards		0	min
Median cumulative indexing throttle time across primary shards		0	min
Max cumulative indexing throttle time across primary shards		0	min
Cumulative merge time of primary shards		39.4971	min
Cumulative merge count of primary shards		50
Min cumulative merge time across primary shards		39.4971	min
Median cumulative merge time across primary shards		39.4971	min
Max cumulative merge time across primary shards		39.4971	min
Cumulative merge throttle time of primary shards		10.1662	min
Min cumulative merge throttle time across primary shards		10.1662	min
Median cumulative merge throttle time across primary shards		10.1662	min
Max cumulative merge throttle time across primary shards		10.1662	min
Cumulative refresh time of primary shards		1.24378	min
Cumulative refresh count of primary shards		73
Min cumulative refresh time across primary shards		1.24378	min
Median cumulative refresh time across primary shards		1.24378	min
Max cumulative refresh time across primary shards		1.24378	min
Cumulative flush time of primary shards		5.24718	min
Cumulative flush count of primary shards		55
Min cumulative flush time across primary shards		5.24718	min
Median cumulative flush time across primary shards		5.24718	min
Max cumulative flush time across primary shards		5.24718	min
Total Young Gen GC time		1.213	s
Total Young Gen GC count		45
Total Old Gen GC time		0	s
Total Old Gen GC count		0
Store size		19.3685	GB
Translog size		5.12227e-08	GB
Heap used for segments		0	MB
Heap used for doc values		0	MB
Heap used for terms		0	MB
Heap used for norms		0	MB
Heap used for points		0	MB
Heap used for stored fields		0	MB
Segment count		33
Min Throughput	index-append	533.09	docs/s
Mean Throughput	index-append	555.14	docs/s
Median Throughput	index-append	553.5	docs/s
Max Throughput	index-append	576.99	docs/s
50th percentile latency	index-append	6682.17	ms
90th percentile latency	index-append	11185.8	ms
99th percentile latency	index-append	14529.5	ms
100th percentile latency	index-append	15045.7	ms
50th percentile service time	index-append	6679.34	ms
90th percentile service time	index-append	11185.8	ms
99th percentile service time	index-append	14529.5	ms
100th percentile service time	index-append	15045.7	ms
error rate	index-append	0	%
Min Throughput	wait-until-merges-finish	0	ops/s
Mean Throughput	wait-until-merges-finish	0	ops/s
Median Throughput	wait-until-merges-finish	0	ops/s
Max Throughput	wait-until-merges-finish	0	ops/s
100th percentile latency	wait-until-merges-finish	224855	ms
100th percentile service time	wait-until-merges-finish	224855	ms
error rate	wait-until-merges-finish	0	%
Min Throughput	default	19.95	ops/s
Mean Throughput	default	19.96	ops/s
Median Throughput	default	19.96	ops/s
Max Throughput	default	19.96	ops/s
50th percentile latency	default	5.60398	ms
90th percentile latency	default	6.0789	ms
99th percentile latency	default	7.24975	ms
100th percentile latency	default	8.27455	ms
50th percentile service time	default	4.74304	ms
90th percentile service time	default	4.98108	ms
99th percentile service time	default	6.21427	ms
100th percentile service time	default	7.28716	ms
error rate	default	0	%
Min Throughput	term	19.99	ops/s
Mean Throughput	term	19.99	ops/s
Median Throughput	term	19.99	ops/s
Max Throughput	term	19.99	ops/s
50th percentile latency	term	6.76375	ms
90th percentile latency	term	7.18768	ms
99th percentile latency	term	8.16676	ms
100th percentile latency	term	8.65761	ms
50th percentile service time	term	5.99482	ms
90th percentile service time	term	6.16012	ms
99th percentile service time	term	7.63441	ms
100th percentile service time	term	7.87623	ms
error rate	term	0	%
Min Throughput	phrase	19.97	ops/s
Mean Throughput	phrase	19.98	ops/s
Median Throughput	phrase	19.98	ops/s
Max Throughput	phrase	19.98	ops/s
50th percentile latency	phrase	7.70236	ms
90th percentile latency	phrase	8.05669	ms
99th percentile latency	phrase	8.66324	ms
100th percentile latency	phrase	10.4943	ms
50th percentile service time	phrase	6.86142	ms
90th percentile service time	phrase	7.01882	ms
99th percentile service time	phrase	7.67354	ms
100th percentile service time	phrase	9.24893	ms
error rate	phrase	0	%
Min Throughput	articles_monthly_agg_uncached	19.9	ops/s
Mean Throughput	articles_monthly_agg_uncached	19.92	ops/s
Median Throughput	articles_monthly_agg_uncached	19.92	ops/s
Max Throughput	articles_monthly_agg_uncached	19.93	ops/s
50th percentile latency	articles_monthly_agg_uncached	9.26923	ms
90th percentile latency	articles_monthly_agg_uncached	9.84287	ms
99th percentile latency	articles_monthly_agg_uncached	12.0116	ms
100th percentile latency	articles_monthly_agg_uncached	14.2108	ms
50th percentile service time	articles_monthly_agg_uncached	8.53063	ms
90th percentile service time	articles_monthly_agg_uncached	8.82642	ms
99th percentile service time	articles_monthly_agg_uncached	11.6331	ms
100th percentile service time	articles_monthly_agg_uncached	13.4582	ms
error rate	articles_monthly_agg_uncached	0	%
Min Throughput	articles_monthly_agg_cached	20.02	ops/s
Mean Throughput	articles_monthly_agg_cached	20.02	ops/s
Median Throughput	articles_monthly_agg_cached	20.02	ops/s
Max Throughput	articles_monthly_agg_cached	20.02	ops/s
50th percentile latency	articles_monthly_agg_cached	3.84179	ms
90th percentile latency	articles_monthly_agg_cached	4.28907	ms
99th percentile latency	articles_monthly_agg_cached	4.66468	ms
100th percentile latency	articles_monthly_agg_cached	4.84658	ms
50th percentile service time	articles_monthly_agg_cached	3.06734	ms
90th percentile service time	articles_monthly_agg_cached	3.19375	ms
99th percentile service time	articles_monthly_agg_cached	3.73989	ms
100th percentile service time	articles_monthly_agg_cached	3.80996	ms
error rate	articles_monthly_agg_cached	0	%
Min Throughput	scroll	12.55	pages/s
Mean Throughput	scroll	12.58	pages/s
Median Throughput	scroll	12.57	pages/s
Max Throughput	scroll	12.64	pages/s
50th percentile latency	scroll	685.659	ms
90th percentile latency	scroll	690.486	ms
99th percentile latency	scroll	723.053	ms
100th percentile latency	scroll	726.376	ms
50th percentile service time	scroll	683.41	ms
90th percentile service time	scroll	687.764	ms
99th percentile service time	scroll	720.837	ms
100th percentile service time	scroll	723.685	ms
error rate	scroll	0	%

opensearch-ci-bot · 2025-03-18T23:54:22Z

Benchmark Baseline Comparison Results

Benchmark Results for Job: https://build.ci.opensearch.org/job/benchmark-compare/40/

Metric	Task	Baseline	Contender	Diff	Unit
Cumulative indexing time of primary shards		66.5823	69.6455	3.06325	min
Min cumulative indexing time across primary shard		66.5823	69.6455	3.06325	min
Median cumulative indexing time across primary shard		66.5823	69.6455	3.06325	min
Max cumulative indexing time across primary shard		66.5823	69.6455	3.06325	min
Cumulative indexing throttle time of primary shards		0	0	0	min
Min cumulative indexing throttle time across primary shard		0	0	0	min
Median cumulative indexing throttle time across primary shard		0	0	0	min
Max cumulative indexing throttle time across primary shard		0	0	0	min
Cumulative merge time of primary shards		36.9144	39.4971	2.58272	min
Cumulative merge count of primary shards		50	50	0
Min cumulative merge time across primary shard		36.9144	39.4971	2.58272	min
Median cumulative merge time across primary shard		36.9144	39.4971	2.58272	min
Max cumulative merge time across primary shard		36.9144	39.4971	2.58272	min
Cumulative merge throttle time of primary shards		9.61303	10.1662	0.55318	min
Min cumulative merge throttle time across primary shard		9.61303	10.1662	0.55318	min
Median cumulative merge throttle time across primary shard		9.61303	10.1662	0.55318	min
Max cumulative merge throttle time across primary shard		9.61303	10.1662	0.55318	min
Cumulative refresh time of primary shards		1.30153	1.24378	-0.05775	min
Cumulative refresh count of primary shards		73	73	0
Min cumulative refresh time across primary shard		1.30153	1.24378	-0.05775	min
Median cumulative refresh time across primary shard		1.30153	1.24378	-0.05775	min
Max cumulative refresh time across primary shard		1.30153	1.24378	-0.05775	min
Cumulative flush time of primary shards		5.02535	5.24718	0.22183	min
Cumulative flush count of primary shards		56	55	-1
Min cumulative flush time across primary shard		5.02535	5.24718	0.22183	min
Median cumulative flush time across primary shard		5.02535	5.24718	0.22183	min
Max cumulative flush time across primary shard		5.02535	5.24718	0.22183	min
Total Young Gen GC time		1.268	1.213	-0.055	s
Total Young Gen GC count		44	45	1
Total Old Gen GC time		0	0	0	s
Total Old Gen GC count		0	0	0
Store size		19.3661	19.3685	0.0024	GB
Translog size		5.12227e-08	5.12227e-08	0	GB
Heap used for segments		0	0	0	MB
Heap used for doc values		0	0	0	MB
Heap used for terms		0	0	0	MB
Heap used for norms		0	0	0	MB
Heap used for points		0	0	0	MB
Heap used for stored fields		0	0	0	MB
Segment count		35	33	-2
Min Throughput	index-append	556.954	533.093	-23.8608	docs/s
Mean Throughput	index-append	573.226	555.145	-18.0815	docs/s
Median Throughput	index-append	570.98	553.497	-17.4832	docs/s
Max Throughput	index-append	591.414	576.988	-14.426	docs/s
50th percentile latency	index-append	6485.8	6682.17	196.374	ms
90th percentile latency	index-append	10736	11185.8	449.817	ms
99th percentile latency	index-append	13461	14529.5	1068.53	ms
100th percentile latency	index-append	14951.6	15045.7	94.0654	ms
50th percentile service time	index-append	6486.85	6679.34	192.495	ms
90th percentile service time	index-append	10731.4	11185.8	454.37	ms
99th percentile service time	index-append	13461	14529.5	1068.53	ms
100th percentile service time	index-append	14951.6	15045.7	94.0654	ms
error rate	index-append	0	0	0	%
Min Throughput	wait-until-merges-finish	0.00548835	0.0044473	-0.00104	ops/s
Mean Throughput	wait-until-merges-finish	0.00548835	0.0044473	-0.00104	ops/s
Median Throughput	wait-until-merges-finish	0.00548835	0.0044473	-0.00104	ops/s
Max Throughput	wait-until-merges-finish	0.00548835	0.0044473	-0.00104	ops/s
100th percentile latency	wait-until-merges-finish	182204	224855	42651.3	ms
100th percentile service time	wait-until-merges-finish	182204	224855	42651.3	ms
error rate	wait-until-merges-finish	0	0	0	%
Min Throughput	default	19.9525	19.9481	-0.00438	ops/s
Mean Throughput	default	19.9591	19.9554	-0.00371	ops/s
Median Throughput	default	19.9594	19.9559	-0.00349	ops/s
Max Throughput	default	19.9645	19.9613	-0.00321	ops/s
50th percentile latency	default	5.20545	5.60398	0.39853	ms
90th percentile latency	default	5.7976	6.0789	0.2813	ms
99th percentile latency	default	6.44605	7.24975	0.8037	ms
100th percentile latency	default	9.36473	8.27455	-1.09018	ms
50th percentile service time	default	4.52042	4.74304	0.22263	ms
90th percentile service time	default	4.84029	4.98108	0.1408	ms
99th percentile service time	default	5.58834	6.21427	0.62593	ms
100th percentile service time	default	8.91136	7.28716	-1.6242	ms
error rate	default	0	0	0	%
Min Throughput	term	19.9619	19.9875	0.02559	ops/s
Mean Throughput	term	19.9673	19.9892	0.02188	ops/s
Median Throughput	term	19.9674	19.9891	0.02171	ops/s
Max Throughput	term	19.972	19.9906	0.01863	ops/s
50th percentile latency	term	6.53014	6.76375	0.23362	ms
90th percentile latency	term	7.08096	7.18768	0.10673	ms
99th percentile latency	term	8.39588	8.16676	-0.22913	ms
100th percentile latency	term	8.5744	8.65761	0.08321	ms
50th percentile service time	term	5.76724	5.99482	0.22757	ms
90th percentile service time	term	6.17798	6.16012	-0.01786	ms
99th percentile service time	term	7.41239	7.63441	0.22201	ms
100th percentile service time	term	7.57209	7.87623	0.30414	ms
error rate	term	0	0	0	%
Min Throughput	phrase	19.9522	19.9724	0.02026	ops/s
Mean Throughput	phrase	19.9593	19.9766	0.01729	ops/s
Median Throughput	phrase	19.9598	19.9771	0.01733	ops/s
Max Throughput	phrase	19.9656	19.98	0.01437	ops/s
50th percentile latency	phrase	7.13969	7.70236	0.56267	ms
90th percentile latency	phrase	7.85193	8.05669	0.20476	ms
99th percentile latency	phrase	9.59321	8.66324	-0.92997	ms
100th percentile latency	phrase	9.82166	10.4943	0.6726	ms
50th percentile service time	phrase	6.35502	6.86142	0.5064	ms
90th percentile service time	phrase	6.89474	7.01882	0.12408	ms
99th percentile service time	phrase	8.78502	7.67354	-1.11149	ms
100th percentile service time	phrase	9.25579	9.24893	-0.00686	ms
error rate	phrase	0	0	0	%
Min Throughput	articles_monthly_agg_uncached	19.8827	19.903	0.02034	ops/s
Mean Throughput	articles_monthly_agg_uncached	19.8993	19.9165	0.01725	ops/s
Median Throughput	articles_monthly_agg_uncached	19.9002	19.9174	0.01721	ops/s
Max Throughput	articles_monthly_agg_uncached	19.9128	19.9277	0.01487	ops/s
50th percentile latency	articles_monthly_agg_uncached	8.42931	9.26923	0.83992	ms
90th percentile latency	articles_monthly_agg_uncached	9.01449	9.84287	0.82838	ms
99th percentile latency	articles_monthly_agg_uncached	11.3613	12.0116	0.65035	ms
100th percentile latency	articles_monthly_agg_uncached	12.872	14.2108	1.33882	ms
50th percentile service time	articles_monthly_agg_uncached	7.70701	8.53063	0.82363	ms
90th percentile service time	articles_monthly_agg_uncached	8.14383	8.82642	0.6826	ms
99th percentile service time	articles_monthly_agg_uncached	10.7346	11.6331	0.89856	ms
100th percentile service time	articles_monthly_agg_uncached	12.6251	13.4582	0.83304	ms
error rate	articles_monthly_agg_uncached	0	0	0	%
Min Throughput	articles_monthly_agg_cached	20.018	20.0165	-0.00152	ops/s
Mean Throughput	articles_monthly_agg_cached	20.0211	20.0189	-0.00228	ops/s
Median Throughput	articles_monthly_agg_cached	20.021	20.0184	-0.00256	ops/s
Max Throughput	articles_monthly_agg_cached	20.0245	20.0222	-0.00225	ops/s
50th percentile latency	articles_monthly_agg_cached	3.02567	3.84179	0.81611	ms
90th percentile latency	articles_monthly_agg_cached	3.51716	4.28907	0.77191	ms
99th percentile latency	articles_monthly_agg_cached	3.85585	4.66468	0.80883	ms
100th percentile latency	articles_monthly_agg_cached	4.02648	4.84658	0.8201	ms
50th percentile service time	articles_monthly_agg_cached	2.25914	3.06734	0.8082	ms
90th percentile service time	articles_monthly_agg_cached	2.61932	3.19375	0.57443	ms
99th percentile service time	articles_monthly_agg_cached	2.80007	3.73989	0.93981	ms
100th percentile service time	articles_monthly_agg_cached	2.83951	3.80996	0.97045	ms
error rate	articles_monthly_agg_cached	0	0	0	%
Min Throughput	scroll	12.5463	12.548	0.00167	pages/s
Mean Throughput	scroll	12.5761	12.5789	0.00281	pages/s
Median Throughput	scroll	12.5693	12.5718	0.00254	pages/s
Max Throughput	scroll	12.6372	12.6423	0.00509	pages/s
50th percentile latency	scroll	666.024	685.659	19.6352	ms
90th percentile latency	scroll	681.993	690.486	8.49335	ms
99th percentile latency	scroll	778.473	723.053	-55.4203	ms
100th percentile latency	scroll	786.847	726.376	-60.4711	ms
50th percentile service time	scroll	663.62	683.41	19.79	ms
90th percentile service time	scroll	679.785	687.764	7.97818	ms
99th percentile service time	scroll	776.09	720.837	-55.2531	ms
100th percentile service time	scroll	784.191	723.685	-60.5066	ms
error rate	scroll	0	0	0	%

rishabh6788 · 2025-03-19T02:23:49Z

{"run-benchmark-test": "id_3"}

github-actions · 2025-03-20T00:03:51Z

✅ Gradle check result for e8ad6be: SUCCESS

codecov · 2025-03-20T00:04:12Z

Codecov Report

Attention: Patch coverage is 92.50000% with 6 lines in your changes missing coverage. Please review.

Project coverage is 72.43%. Comparing base (6d53f9d) to head (7b7f927).
Report is 3 commits behind head on main.

Files with missing lines	Patch %	Lines
.../java/org/opensearch/common/util/FeatureFlags.java	92.50%	5 Missing and 1 partial ⚠️

Additional details and impacted files

@@             Coverage Diff              @@
##               main   #17611      +/-   ##
============================================
+ Coverage     72.40%   72.43%   +0.03%     
+ Complexity    65828    65823       -5     
============================================
  Files          5316     5316              
  Lines        305294   305395     +101     
  Branches      44289    44303      +14     
============================================
+ Hits         221033   221208     +175     
+ Misses        66187    66112      -75     
- Partials      18074    18075       +1

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

🚀 New features to boost your workflow:

❄️ Test Analytics: Detect flaky tests, report on failures, and find test suite problems.

finnegancarroll · 2025-03-20T00:10:54Z

distance_amount_agg is just barely showing a ~5% regression in 90+ percentile latency.
#17611 (comment)

msfroh

This is a great improvement! Using annotations is much easier (and more consistent) for developers testing their code.

CHANGELOG.md

github-actions · 2025-03-20T18:55:42Z

✅ Gradle check result for 10cdbec: SUCCESS

jainankitk

@finnegancarroll - Few nitpicks, but otherwise this looks great!

server/src/main/java/org/opensearch/common/util/FeatureFlags.java

jainankitk · 2025-03-20T19:57:56Z

server/src/main/java/org/opensearch/common/util/FeatureFlags.java

+            for (Setting<Boolean> ff : featureFlags.keySet()) {
+                if (ff.getKey().equals(featureFlagName)) featureFlags.put(ff, value);


Initially, I was confused why we need to iterate over the keys for putting featureFlag, then I noticed need to compare the setting name, within Setting object that is the key. I am assuming this is done only during the initialization?

I think Ideally feature flags would be represented as a Setting everywhere. However, since annotations only take primitive types I ended up with this pattern where I look up the feature flag from its setting key.

andrross · 2025-03-20T20:54:04Z

server/src/main/java/org/opensearch/common/util/FeatureFlags.java

+    }
+
+    /**
+     * Provides feature flag write access and synchronization for test use cases.


Copying from this comment from @reta about how tests are run:

So this is my understanding of how we do that:

we run test suites concurrently by forking JVMs

we never run test suites (or tests with same suite) concurrently within same JVM

The static state between tests should not be shared between JVMs but it is possible that we don't clean up properly after some test suites so this problem emerges.

As far as I know there should be no contention between tests modifying feature flags in the static state because within one JVM there is only ever one test running at one time. That means the concurrency primitives here (ReentrantLocks) should not be needed. I'd be in favor of not using locks if they are not necessary because it will only add to confusion about how tests are run.

I see. In hindsight that makes sense as otherwise OpenSearchTestCase resetting all flags every test would make feature flags extremely unstable. I notice that if I remove the FlagLock class completely I run into issues as test cases sometimes initialize new nodes which clears all previous feature flag settings. This breaks the annotation since it sets the flag before test setup.

I've moved the FlagLock class to just make the flag immutable for its lifetime to preserve the annotation functionality. Let me know if this makes sense or if it would be more appropriate to drop the annotation & FlagLock class entirely.

github-actions · 2025-03-23T22:57:11Z

❌ Gradle check result for af94175: FAILURE

Please examine the workflow log, locate, and copy-paste the failure(s) below, then iterate to green. Is the failure a flaky test unrelated to your change?

- Remove internal immutable `settings` in favor of `ConcurrentHashMap`. - Move functionality to internal `FeatureFlagsImpl` class. Expose public api of `FeatureFlagsImpl` in FeatureFlags. Expose test api of `FeatureFlagsImpl` in FeatureFlags.TestUtils. - Read and set JVM system properties once on `initializeFeatureFlags`. Remove JVM system properties check from `isEnabled`. - Add `FlagLock` in `TestUtils` to maintain a lock for each feature flag. - Add helper functions to set & access feature flags in a thread safe way. `TestUtils.with(<feature flag>, () -> {})` to execute crtical sections. `New FlagLock(<feature flag>)` for fine grained control. Signed-off-by: Finn Carroll <carrofin@amazon.com>

- Add annotation in OpenSearchTestCase to enable and lock a flag for the duration of a single test case. Signed-off-by: Finn Carroll <carrofin@amazon.com>

- Add cases for public api. - Add cases for thread safe helpers @LockFeatureFlag FlagLock TestUtils.with Signed-off-by: Finn Carroll <carrofin@amazon.com>

Signed-off-by: Finn Carroll <carrofin@amazon.com>

Replace all usages of `FeatureFlagSetter` in tests. Replace all usages of JVM system properties for feature flags in tests. Replace all usages of `initializeFeatureFlags` with `TestUtils.set` in tests. Signed-off-by: Finn Carroll <carrofin@amazon.com>

Signed-off-by: Finn Carroll <carrofin@amazon.com>

- Add missing LockFeatureFlag annotations. - Cannot use annotation in tests which expect exception thrown. - SEARCHABLE_SNAPSHOT_EXTENDED_COMPATIBILITY has no setting? Adding. - Flight server tests need flag enabled on setup. Signed-off-by: Finn Carroll <carrofin@amazon.com>

JUnit does not run tests in parallel on the same JVM so these are not necessary. Additionally rename to `FlagWriteLock` for clarity. Signed-off-by: Finn Carroll <carrofin@amazon.com>

Address ff contant nit. Signed-off-by: Finn Carroll <carrofin@amazon.com>

Signed-off-by: Finn Carroll <carrofin@amazon.com>

github-actions · 2025-03-24T17:29:08Z

❌ Gradle check result for 7b7f927: FAILURE

Please examine the workflow log, locate, and copy-paste the failure(s) below, then iterate to green. Is the failure a flaky test unrelated to your change?

finnegancarroll · 2025-03-24T17:58:53Z

ReactorNetty4StreamingStressIT is flaky (reached timeout)
#15840

github-actions · 2025-03-24T18:56:40Z

✅ Gradle check result for 7b7f927: SUCCESS

github-actions bot added bug Something isn't working good first issue Good for newcomers Search Search query, autocomplete ...etc v3.0.0 Issues and PRs related to version 3.0.0 labels Mar 17, 2025

finnegancarroll force-pushed the feature-flag-kinda-slow branch from fdec16b to f1d5490 Compare March 17, 2025 19:25

finnegancarroll force-pushed the feature-flag-kinda-slow branch from f1d5490 to 1228bda Compare March 17, 2025 19:44

finnegancarroll force-pushed the feature-flag-kinda-slow branch from 1228bda to b688424 Compare March 17, 2025 20:06

finnegancarroll force-pushed the feature-flag-kinda-slow branch from 4a3e60f to 4bc33cd Compare March 18, 2025 20:37

finnegancarroll force-pushed the feature-flag-kinda-slow branch from 4bc33cd to 94fd7af Compare March 18, 2025 21:06

finnegancarroll force-pushed the feature-flag-kinda-slow branch from feec9a7 to e8ad6be Compare March 18, 2025 21:52

finnegancarroll changed the title ~~Refactor feature flags for better test support~~ Refactor FeatureFlags Mar 18, 2025

github-actions bot mentioned this pull request Mar 18, 2025

Manual approval required for workflow run 13935021859: Request to approve/deny benchmark run for PR #17611 #17625

Closed

github-actions bot mentioned this pull request Mar 19, 2025

Manual approval required for workflow run 13937488390: Request to approve/deny benchmark run for PR #17611 #17629

Closed

finnegancarroll requested review from bugmakerrrrrr, jainankitk and linuxpi as code owners March 19, 2025 23:05

msfroh reviewed Mar 20, 2025

View reviewed changes

CHANGELOG.md Show resolved Hide resolved

msfroh approved these changes Mar 20, 2025

View reviewed changes

finnegancarroll force-pushed the feature-flag-kinda-slow branch from e8ad6be to 10cdbec Compare March 20, 2025 18:00

jainankitk reviewed Mar 20, 2025

View reviewed changes

andrross reviewed Mar 20, 2025

View reviewed changes

finnegancarroll added 9 commits March 24, 2025 09:15

Add @LockFeatureFlag annotation

6ca4e92

- Add annotation in OpenSearchTestCase to enable and lock a flag for the duration of a single test case. Signed-off-by: Finn Carroll <carrofin@amazon.com>

Update FeatureFlagTests

57115ef

- Add cases for public api. - Add cases for thread safe helpers @LockFeatureFlag FlagLock TestUtils.with Signed-off-by: Finn Carroll <carrofin@amazon.com>

Remove FeatureFlagSetter

5365a94

Signed-off-by: Finn Carroll <carrofin@amazon.com>

Add changelog entry

8882907

Signed-off-by: Finn Carroll <carrofin@amazon.com>

Remove concurrency primitives

c210e90

JUnit does not run tests in parallel on the same JVM so these are not necessary. Additionally rename to `FlagWriteLock` for clarity. Signed-off-by: Finn Carroll <carrofin@amazon.com>

Fix flight service IT.

9a35bf1

Address ff contant nit. Signed-off-by: Finn Carroll <carrofin@amazon.com>

finnegancarroll force-pushed the feature-flag-kinda-slow branch from 1bf0ba1 to 9a35bf1 Compare March 24, 2025 16:15

Nit.

7b7f927

Signed-off-by: Finn Carroll <carrofin@amazon.com>

finnegancarroll closed this Mar 24, 2025

finnegancarroll reopened this Mar 24, 2025

opensearch-ci-bot mentioned this pull request Mar 24, 2025

[AUTOCUT] Gradle Check Flaky Test Report for ReactorNetty4StreamingStressIT #15840

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Refactor FeatureFlags #17611

Refactor FeatureFlags #17611

finnegancarroll commented Mar 17, 2025 •

edited

Loading

github-actions bot commented Mar 17, 2025

github-actions bot commented Mar 17, 2025

github-actions bot commented Mar 17, 2025

github-actions bot commented Mar 17, 2025

finnegancarroll commented Mar 17, 2025

github-actions bot commented Mar 18, 2025

github-actions bot commented Mar 18, 2025

github-actions bot commented Mar 18, 2025

finnegancarroll commented Mar 18, 2025

github-actions bot commented Mar 18, 2025

finnegancarroll commented Mar 18, 2025

opensearch-ci-bot commented Mar 18, 2025

Benchmark Results for Job: https://build.ci.opensearch.org/job/benchmark-pull-request/2624/

opensearch-ci-bot commented Mar 18, 2025

Benchmark Results for Job: https://build.ci.opensearch.org/job/benchmark-compare/40/

rishabh6788 commented Mar 19, 2025

github-actions bot commented Mar 20, 2025

codecov bot commented Mar 20, 2025 •

edited

Loading

finnegancarroll commented Mar 20, 2025

msfroh left a comment

github-actions bot commented Mar 20, 2025

jainankitk left a comment

jainankitk Mar 20, 2025

finnegancarroll Mar 24, 2025

andrross Mar 20, 2025

finnegancarroll Mar 23, 2025

github-actions bot commented Mar 23, 2025

github-actions bot commented Mar 24, 2025

finnegancarroll commented Mar 24, 2025

github-actions bot commented Mar 24, 2025

		for (Setting<Boolean> ff : featureFlags.keySet()) {
		if (ff.getKey().equals(featureFlagName)) featureFlags.put(ff, value);

Refactor FeatureFlags #17611

Are you sure you want to change the base?

Refactor FeatureFlags #17611

Conversation

finnegancarroll commented Mar 17, 2025 • edited Loading

Description

Related Issues

Check List

github-actions bot commented Mar 17, 2025

github-actions bot commented Mar 17, 2025

github-actions bot commented Mar 17, 2025

github-actions bot commented Mar 17, 2025

finnegancarroll commented Mar 17, 2025

github-actions bot commented Mar 18, 2025

github-actions bot commented Mar 18, 2025

github-actions bot commented Mar 18, 2025

finnegancarroll commented Mar 18, 2025

github-actions bot commented Mar 18, 2025

finnegancarroll commented Mar 18, 2025

opensearch-ci-bot commented Mar 18, 2025

Benchmark Results for Job: https://build.ci.opensearch.org/job/benchmark-pull-request/2624/

opensearch-ci-bot commented Mar 18, 2025

Benchmark Results for Job: https://build.ci.opensearch.org/job/benchmark-compare/40/

rishabh6788 commented Mar 19, 2025

github-actions bot commented Mar 20, 2025

codecov bot commented Mar 20, 2025 • edited Loading

Codecov Report

finnegancarroll commented Mar 20, 2025

msfroh left a comment

Choose a reason for hiding this comment

github-actions bot commented Mar 20, 2025

jainankitk left a comment

Choose a reason for hiding this comment

jainankitk Mar 20, 2025

Choose a reason for hiding this comment

finnegancarroll Mar 24, 2025

Choose a reason for hiding this comment

andrross Mar 20, 2025

Choose a reason for hiding this comment

finnegancarroll Mar 23, 2025

Choose a reason for hiding this comment

github-actions bot commented Mar 23, 2025

github-actions bot commented Mar 24, 2025

finnegancarroll commented Mar 24, 2025

github-actions bot commented Mar 24, 2025

finnegancarroll commented Mar 17, 2025 •

edited

Loading

codecov bot commented Mar 20, 2025 •

edited

Loading