Our perf thresholds have changed #178

Lukasa · 2024-06-07T14:29:22Z

No description provided.

Lukasa · 2024-06-07T14:58:06Z

The retain/release counts in sync code have become very unstable, and I don't think they're providing us much value in comparison to the time I'm spending trying to dial them in. So I propose we just remove them.

dnadoba · 2024-06-07T15:03:17Z

Fair! If they are not stable, even in sync code, we need to disabled them. It is surprising that this now effects sync code. @hassila are there any known issues?
I'm only aware of ordo-one/package-benchmark#189 but I don't think this should make it non-deterministic.

dnadoba · 2024-06-07T15:04:26Z

Benchmarks/Thresholds/5.10/CertificatesBenchmark.Parse_WebPKI_Roots_from_DER.p90.json

+  "releaseCount" : 120271,
+  "retainCount" : 109425,


Can we remove the retain/release counts from the output as well please?

hassila · 2024-06-07T20:11:10Z

Nothing that I know of that should make them unstable (possibly if you run any sampling metric like thread counts etc? If you run with only ARC metrics enabled is it still unstable?). Then of course it doesn't always add up as mentioned in the related case (some objects are also created with initial ref count which is another factor why we sometime see more releases).

Lukasa · 2024-06-10T06:38:07Z

I don't think we had any sampling metrics enabled: we recorded syscalls, allocations, and retains/releases. And the issue wasn't an adding-up problem: it was that the values changed from run-to-run.

hassila · 2024-06-10T08:29:19Z

Yeah, that's very strange then, that they don't add up is known, but they should be stable.

hassila · 2024-06-10T08:30:24Z

(to be sure to isolate it, you might try running with only arc metrics enabled, then I can't really see how any other parts of the benchmark infrastructure could affect it much)

dnadoba · 2024-08-10T17:17:22Z

@Lukasa have you investigated why the thresholds have changed? How have you updated them?

I'm running ./dev/update-benchmark-thresholds locally which produces a stable output that I have opened a PR with just now: #184
However, CI doesn't agree on these. These numbers seem to be stable on CI as well.
Docker is used locally and on CI so I wouldn't expect any difference.

Lukasa · 2024-08-12T07:41:13Z

My current theory is that because we aren't scaling the iterations, we're extremely sensitive to minor variations. We should have things settle down if we scale the iteration count by kilo or mega.

Our perf thresholds have changed

ebb08e1

Lukasa added the semver/none No version bump required. label Jun 7, 2024

Apparently local runs don't match

af1f167

dnadoba approved these changes Jun 7, 2024

View reviewed changes

Stop using retain/release thresholds

ccf6a38

Lukasa requested a review from dnadoba June 7, 2024 14:57

dnadoba requested changes Jun 7, 2024

View reviewed changes

Remove retain/release counts

a6fc8cb

dnadoba approved these changes Jun 7, 2024

View reviewed changes

Lukasa merged commit 4688f24 into apple:main Jun 7, 2024
6 checks passed

Lukasa deleted the cb-fix-performance-thresholds branch June 7, 2024 15:27

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Our perf thresholds have changed #178

Our perf thresholds have changed #178

Lukasa commented Jun 7, 2024

Lukasa commented Jun 7, 2024

dnadoba commented Jun 7, 2024 •

edited

Loading

dnadoba Jun 7, 2024

Lukasa Jun 7, 2024

hassila commented Jun 7, 2024

Lukasa commented Jun 10, 2024

hassila commented Jun 10, 2024

hassila commented Jun 10, 2024

dnadoba commented Aug 10, 2024

Lukasa commented Aug 12, 2024

Our perf thresholds have changed #178

Our perf thresholds have changed #178

Conversation

Lukasa commented Jun 7, 2024

Lukasa commented Jun 7, 2024

dnadoba commented Jun 7, 2024 • edited Loading

dnadoba Jun 7, 2024

Choose a reason for hiding this comment

Lukasa Jun 7, 2024

Choose a reason for hiding this comment

hassila commented Jun 7, 2024

Lukasa commented Jun 10, 2024

hassila commented Jun 10, 2024

hassila commented Jun 10, 2024

dnadoba commented Aug 10, 2024

Lukasa commented Aug 12, 2024

dnadoba commented Jun 7, 2024 •

edited

Loading