Add operations/arguments to local CuPy array benchmark #524

charlesbluca · 2021-02-12T01:12:39Z

This PR adds the following operations to the local CuPy array benchmark:

sum
mean
array slicing

This also adds an additional special argument, --benchmark-json, which takes an optional path to dump the results of the benchmark in JSON format. This would allow us to generate plots using the output, as discussed in #517.

Some thoughts:

Should there be an additional argument to specify the array slicing interval (which is currently fixed at 3)?
Could the JSON output be cleaned up? Currently, a (truncated) sample output file looks like:

{
  "operation": "transpose_sum",
  "size": 10000,
  "second_size": 1000,
  "chunk_size": 2500,
  "compute_size": [
    10000,
    10000
  ],
  "compute_chunk_size": [
    2500,
    2500
  ],
  "ignore_size": "1.05 MB",
  "protocol": "tcp",
  "devs": "0,1,2,3",
  "threads_per_worker": 1,
  "times": [
    {
      "wall_clock": 1.4910394318867475,
      "npartitions": 16
    }
  ],
  "bandwidths": {
    "(00,01)": {
      "25%": "136.34 MB/s",
      "50%": "156.67 MB/s",
      "75%": "163.32 MB/s",
      "total_nbytes": "150.00 MB"
    }
  }
}

codecov-io · 2021-02-12T01:59:29Z

Codecov Report

Merging #524 (ff03dec) into branch-0.18 (32d9d33) will increase coverage by 2.02%.
The diff coverage is 96.35%.

@@               Coverage Diff               @@
##           branch-0.18     #524      +/-   ##
===============================================
+ Coverage        90.42%   92.45%   +2.02%     
===============================================
  Files               15       16       +1     
  Lines             1128     1550     +422     
===============================================
+ Hits              1020     1433     +413     
- Misses             108      117       +9

Impacted Files	Coverage Δ
dask_cuda/cli/dask_cuda_worker.py	`96.92% <ø> (ø)`
dask_cuda/cuda_worker.py	`78.75% <75.00%> (+1.73%)`	⬆️
dask_cuda/device_host_file.py	`90.90% <80.00%> (-7.96%)`	⬇️
dask_cuda/get_device_memory_objects.py	`89.04% <89.04%> (ø)`
dask_cuda/proxify_device_objects.py	`93.87% <93.87%> (ø)`
dask_cuda/proxy_object.py	`91.21% <96.07%> (+3.42%)`	⬆️
dask_cuda/explicit_comms/dataframe/merge.py	`96.24% <96.24%> (ø)`
dask_cuda/explicit_comms/dataframe/shuffle.py	`98.51% <98.51%> (ø)`
dask_cuda/__init__.py	`100.00% <100.00%> (ø)`
dask_cuda/explicit_comms/comms.py	`98.78% <100.00%> (-0.23%)`	⬇️
... and 8 more

Continue to review full report at Codecov.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update f9ce83a...ff03dec. Read the comment docs.

madsbk

Looks good to me, nice addition @charlesbluca.

Should there be an additional argument to specify the array slicing interval (which is currently fixed at 3)?

I think fixing it to 3 is fine for now. If we need to benchmark the impact of step sizes, we can always add it. At that point, we might consider using the first position argument as a command and use the following arguments as command-specific (e.g. like how git works)

Could the JSON output be cleaned up? Currently, a (truncated) sample output file looks like:

Looks fine to me :)

madsbk · 2021-02-12T08:18:46Z

dask_cuda/benchmarks/local_cupy.py

@@ -179,6 +197,40 @@ async def run(args):
                )
                print(fmt % (d1, d2, bw[0], bw[1], bw[2], total_nbytes[(d1, d2)]))

+            if args.benchmark_json:
+                import json


Moving import json to the top should be fine

Done! Also going to change this to from json import dump.

madsbk · 2021-02-12T08:31:01Z

dask_cuda/benchmarks/local_cupy.py

+        x = rs.random((args.size, args.size), chunks=args.chunk_size).persist()
+        await wait(x)
+        func_args = (x,)
+


Suggested change

pentschev

LGTM as well, thanks @charlesbluca !

pentschev · 2021-02-12T17:35:14Z

@gpucibot merge

charlesbluca added 6 commits February 11, 2021 08:27

Add cross product, sum, mean operations

015a0ea

Resorting packages

84703ec

Add option to output JSON

c68fed9

Add slice operation

26ed1ce

Remove nonexistent cross operation

58c2cc9

Clean up JSON output

ff03dec

charlesbluca requested a review from a team as a code owner February 12, 2021 01:12

charlesbluca mentioned this pull request Feb 12, 2021

PTDS Benchmarks #517

Open

madsbk added 3 - Ready for Review Ready for review by team improvement Improvement / enhancement to an existing function non-breaking Non-breaking change labels Feb 12, 2021

madsbk approved these changes Feb 12, 2021

View reviewed changes

madsbk reviewed Feb 12, 2021

View reviewed changes

Move json import to top of file

b1b45e5

pentschev approved these changes Feb 12, 2021

View reviewed changes

pentschev added 5 - Ready to Merge Testing and reviews complete, ready to merge and removed 3 - Ready for Review Ready for review by team labels Feb 12, 2021

rapids-bot bot merged commit acded78 into rapidsai:branch-0.18 Feb 12, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add operations/arguments to local CuPy array benchmark #524

Add operations/arguments to local CuPy array benchmark #524

charlesbluca commented Feb 12, 2021

codecov-io commented Feb 12, 2021 •

edited

Loading

madsbk left a comment

madsbk Feb 12, 2021

charlesbluca Feb 12, 2021

madsbk Feb 12, 2021

pentschev left a comment

pentschev commented Feb 12, 2021

Add operations/arguments to local CuPy array benchmark #524

Add operations/arguments to local CuPy array benchmark #524

Conversation

charlesbluca commented Feb 12, 2021

codecov-io commented Feb 12, 2021 • edited Loading

Codecov Report

madsbk left a comment

Choose a reason for hiding this comment

madsbk Feb 12, 2021

Choose a reason for hiding this comment

charlesbluca Feb 12, 2021

Choose a reason for hiding this comment

madsbk Feb 12, 2021

Choose a reason for hiding this comment

pentschev left a comment

Choose a reason for hiding this comment

pentschev commented Feb 12, 2021

codecov-io commented Feb 12, 2021 •

edited

Loading