Tracy captures produced by benchmark pipeline aren't grouping CPU codegen or GPU zones #7219

ScottTodd · 2021-09-30T16:51:40Z

_{some discussion on IREE's Discord here}

Examples traces can be downloaded from the artifacts tab on https://buildkite.com/iree/iree-benchmark/builds/1107#d697f531-34bf-4372-9942-6d373b8ece5f

Ungrouped CPU zone statistics:

Ungrouped GPU child zone statistics:

https://github.com/google/iree/blob/main/build_tools/benchmarks/run_benchmarks_on_android.py is the main script for running those benchmarks and collecting traces from them.

I tried to reproduce this on my Windows development machine with Tracy's capture GUI and CLI and an unrooted Samsung Galaxy S10 and was able to see grouped zones using both the CPU and GPU targets / HAL drivers:

ScottTodd · 2021-09-30T17:37:29Z

things left to check to get a repro:

the capture tool built + used for the pipeline on Linux
the lab phones
the python scripts used for the pipeline

antiagainst · 2021-10-02T18:50:19Z

I tried running the run_benchmarks_on_android.py script to benchmark on a local Android phone from a x86 Linux host. Everything works fine. Then I tried to on an aarch64 host (Raspberry Pi 4), it is not working. It's the same script, the same artifacts (benchmark suites, iree-benchmark-module), and the same phone. What's different is the Tracy capture tool. I have a tracy capture tool compiled for aarch64 for the latter case. In the lab we also use RPI4 to drive the phones.

So right now I suspect there are issues with Tracy capture tool compiled towards aarch64. Or maybe it's due to that we capture with a capture tool compiled for aarch64 and view the capture with a Tracy GUI tool compiled for x86? That's problematic?

I also tried Ben's wolfpld/tracy#262. That does not help either.

antiagainst · 2021-10-06T20:24:31Z

@benvanik: I don't know much internals about Tracy. So the above is more of my guess. Does it make sense?

benvanik · 2021-10-06T20:29:31Z

That's useful information! I was only trying to capture from an x86 host. Finding the right place to put some printfs that we can read back from the logs on the rpi would be useful.

antiagainst · 2022-07-28T21:52:35Z

So right now I suspect there are issues with Tracy capture tool compiled towards aarch64. Or maybe it's due to that we capture with a capture tool compiled for aarch64 and view the capture with a Tracy GUI tool compiled for x86? That's problematic?

With my Apple M1 macbook, I have both the capture tool and the profiler UI compiled for aarch64 and it works fine. But still I cannot use the profiler UI to open the captures generated from those RPI devices (which is also aarch64).. So I guess it might have something to do with the libraries Tracy depends on Ubuntu?

benvanik · 2022-07-28T22:15:52Z

I still have issues loading those android traces - I think I tracked it down to something that looked like undefined behavior somewhere in either the recording of string tables or the parsing of them but wasn't able to figure it out.

antiagainst · 2022-07-28T22:27:00Z

Yeah, me too. This bug is really wild.. Time to update Tracy though! It has been almost a quarter. :)

ScottTodd · 2023-12-15T22:55:55Z

I wonder if this reproduces on the latest Tracy / newer phones. This issue is quite old 🤔

ScottTodd · 2024-08-13T22:40:22Z

We switched from builtkite to github actions and then later dropped Tracy support from the benchmarks pipelines. Closing this old issue.

ScottTodd added bug 🐞 Something isn't working performance ⚡ Performance/optimization related work across the compiler and runtime infrastructure/benchmark Relating to benchmarking infrastructure labels Sep 30, 2021

GMNGeoffrey added the infrastructure Relating to build systems, CI, or testing label Dec 2, 2021

julianwa added this to IREE - ARCHIVED (do not update) Jun 2, 2022

GMNGeoffrey added this to IREE Jun 28, 2022

GMNGeoffrey added this to (Deprecated) IREE Feb 21, 2023

github-project-automation bot moved this to Not Started in (Deprecated) IREE Feb 21, 2023

allieculp moved this from Not Started to Backlog in (Deprecated) IREE May 19, 2023

ScottTodd closed this as not planned Won't fix, can't repro, duplicate, stale Aug 13, 2024

github-project-automation bot moved this to Done in IREE - ARCHIVED (do not update) Aug 13, 2024

github-project-automation bot moved this from Backlog to Done in (Deprecated) IREE Aug 13, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Tracy captures produced by benchmark pipeline aren't grouping CPU codegen or GPU zones #7219

Tracy captures produced by benchmark pipeline aren't grouping CPU codegen or GPU zones #7219

ScottTodd commented Sep 30, 2021 •

edited

Loading

ScottTodd commented Sep 30, 2021

antiagainst commented Oct 2, 2021

antiagainst commented Oct 6, 2021

benvanik commented Oct 6, 2021

antiagainst commented Jul 28, 2022

benvanik commented Jul 28, 2022

antiagainst commented Jul 28, 2022

ScottTodd commented Dec 15, 2023

ScottTodd commented Aug 13, 2024

Tracy captures produced by benchmark pipeline aren't grouping CPU codegen or GPU zones #7219

Tracy captures produced by benchmark pipeline aren't grouping CPU codegen or GPU zones #7219

Comments

ScottTodd commented Sep 30, 2021 • edited Loading

ScottTodd commented Sep 30, 2021

antiagainst commented Oct 2, 2021

antiagainst commented Oct 6, 2021

benvanik commented Oct 6, 2021

antiagainst commented Jul 28, 2022

benvanik commented Jul 28, 2022

antiagainst commented Jul 28, 2022

ScottTodd commented Dec 15, 2023

ScottTodd commented Aug 13, 2024

ScottTodd commented Sep 30, 2021 •

edited

Loading