Skip to content

Commit

Permalink
Add punet benchmarking to the regression suite (iree-org#19088)
Browse files Browse the repository at this point in the history
This commit adds support to benchmark punet fp16/fp8 performance TOM.
This concludes adding all the necessary testing for the SDXL model. It
also switches the compilation of punet to use the spec file as it is
necessary for tuning at the current state of the project. I've also
updated the artifacts in azure, and this time using the date as part of
the azure link so everyone knows the time the artifacts were generated.
Nithin is working on implementing the spec file optimizations as part of
the compiler itself, so we can remove the usage of such files in the
future. All tests have timeouts now too and updated existing ones
because the CLI flag seems to be per test timeouts (not the whole pytest
command itself). Now, we can avoid hangs such as
https://github.com/iree-org/iree/actions/runs/11748746984/job/32734141414

---------

Signed-off-by: saienduri <[email protected]>
Signed-off-by: saienduri <[email protected]>
Co-authored-by: saienduri <[email protected]>
  • Loading branch information
saienduri and saienduri authored Nov 13, 2024
1 parent cb5d1ab commit 43b22de
Show file tree
Hide file tree
Showing 5 changed files with 593 additions and 10 deletions.
14 changes: 11 additions & 3 deletions .github/workflows/pkgci_regression_test.yml
Original file line number Diff line number Diff line change
Expand Up @@ -112,7 +112,7 @@ jobs:
--no-skip-tests-missing-files \
--capture=no \
--log-cli-level=info \
--timeout=1200 \
--timeout=240 \
--durations=0 \
--config-files=${MODELS_CONFIG_FILE_PATH}
Expand Down Expand Up @@ -189,7 +189,7 @@ jobs:
-rpfE \
--capture=no \
--log-cli-level=info \
--timeout=1200 \
--timeout=240 \
--durations=0
env:
ROCM_CHIP: ${{ matrix.rocm-chip }}
Expand All @@ -203,7 +203,7 @@ jobs:
-rpfE \
--capture=no \
--log-cli-level=info \
--timeout=1200 \
--timeout=240 \
--durations=0
env:
ROCM_CHIP: ${{ matrix.rocm-chip }}
Expand All @@ -227,6 +227,7 @@ jobs:
--goldensize-rocm-clip-bytes 860000 \
--goldensize-rocm-vae-bytes 840000 \
--rocm-chip gfx90a \
--timeout=240 \
--log-cli-level=info \
--retries 7
echo "$(<job_summary.md )" >> $GITHUB_STEP_SUMMARY
Expand All @@ -247,7 +248,14 @@ jobs:
--goldensize-rocm-unet-bytes 2270000 \
--goldensize-rocm-clip-bytes 860000 \
--goldensize-rocm-vae-bytes 840000 \
--goldentime-rocm-punet-int8-fp16-ms 55 \
--goldendispatch-rocm-punet-int8-fp16 1284 \
--goldensize-rocm-punet-int8-fp16-bytes 2560000 \
--goldentime-rocm-punet-int8-fp8-ms 59 \
--goldendispatch-rocm-punet-int8-fp8 1564 \
--goldensize-rocm-punet-int8-fp8-bytes 2800000 \
--rocm-chip gfx942 \
--log-cli-level=info \
--timeout=240 \
--retries 7
echo "$(<job_summary.md )" >> $GITHUB_STEP_SUMMARY
Loading

0 comments on commit 43b22de

Please sign in to comment.