Add Triton benchmarks for blog #509

rishic3 · 2025-03-11T21:51:27Z

Scripts, configs, and instructions to reproduce the blog benchmarks, displaying the benefit of using Triton for CPU parallelism..

Signed-off-by: Rishi Chandra <[email protected]>

eordentlich · 2025-03-12T03:09:16Z

examples/ML+DL-Examples/Spark-DL/dl_inference/benchmark/README.md

+1. [`spark_resnet.py`](spark_resnet.py): Uses predict_batch_udf to perform in-process prediction on the GPU.
+2. [`spark_resnet_triton.py`](spark_resnet_triton.py): Uses predict_batch_udf to send inference requests to Triton, which performs inference on the GPU.
+
+Spark cannot change the task parallelism within a stage based on the resources required (i.e., multiple CPUs for preprocessing vs. single GPU for inference). Therefore, implementation (1) will limit to 1 task per GPU to enable one instance of the model on the GPU. In contrast, implementation (2) allows as many tasks to run in parallel as cores on the executor, since Triton handles inference on the GPU.


For resnet-50 could multiple model instances fit in the GPU? If so, might be good to benchmark that case, where multiple spark tasks run per GPU with each having its own model instance. Due to multiple processes, GPU compute will be time sliced so perf could be hit, but still interesting to compare.

eordentlich · 2025-03-12T20:42:23Z

examples/ML+DL-Examples/Spark-DL/dl_inference/benchmark/spark_resnet_triton.py

Would it make sense to consolidate this script with spark_resnet.py and select library or triton via cli argument?

rishic3 added 4 commits March 11, 2025 14:46

Add blog benchmarks

fb0c5d5

Carriage returns

3cc6745

carriage returns p2

8365bce

signoff

9f4bdd3

Signed-off-by: Rishi Chandra <[email protected]>

rishic3 requested a review from eordentlich March 11, 2025 22:13

rishic3 added 2 commits March 11, 2025 15:16

clarify readme

4887c3d

typos/fixes

2614369

rishic3 force-pushed the triton-benchmarks branch from 7c26cc0 to 2614369 Compare March 12, 2025 01:59

rishic3 added 4 commits March 12, 2025 10:00

Updated file names

238fa6c

Add symlink

55de26f

Create results dir if not exists

1c5352d

Add dataset size arg

27bbdfe

eordentlich reviewed Mar 12, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add Triton benchmarks for blog #509

Add Triton benchmarks for blog #509

rishic3 commented Mar 11, 2025

eordentlich Mar 12, 2025

eordentlich Mar 12, 2025

Add Triton benchmarks for blog #509

Are you sure you want to change the base?

Add Triton benchmarks for blog #509

Conversation

rishic3 commented Mar 11, 2025

eordentlich Mar 12, 2025

Choose a reason for hiding this comment

eordentlich Mar 12, 2025

Choose a reason for hiding this comment