Add tests and example for multi GPU usage #1909

bernhardmgruber · 2023-01-25T13:02:34Z

Inspecting alpaka's implementations when thinking about zero-copying as part of #1820 I wondered whether alpaka actually supports copying buffers between two GPUs on e.g. the CUDA backend. Searching alpaka for API calls like cudaMemcpyPeer and cudaDeviceEnablePeerAccess only points me to the documentation here, which says that cudaDeviceEnablePeerAccess is "automatically done when required", but the API is never called inside the alpaka codebase. So I wondered, whether CUDA just does that automatically as part of cudaMemcpy when the source and destination are on different GPUs, and whether that is a feature of CUDA that is always present or requires some kind of minium CUDA version or compute architecture. Does anyone know?

Independently, we should have tests and also one example to show such a scenario. It also concerns all backends, not just CUDA.

The text was updated successfully, but these errors were encountered:

fwyzard · 2023-01-25T13:41:59Z

By the way, cudaMemcpy and cudaMemcpyAsync are not able to copy memory allocated by cudaMallocAsync across different devices. The comment from NVIDIA is to use cudaMemcpyPeer or cudaMemcpyPeerAsync.

See https://github.com/fwyzard/nvidia_bug_3446335 for a reproducer.

psychocoderHPC · 2023-01-25T17:08:12Z

I removed the explicit peer copies in the past #1400 because cudaMemcpy* is doing it automatically. There was no need to fiddle around with the peer copies anymore. Looks like I forgot to remove this in the documentation.

bernhardmgruber added Type:Testing Type:Example labels Jan 25, 2023

bernhardmgruber added this to alpaka 1.1.0 Jan 25, 2023

bernhardmgruber mentioned this issue Jan 26, 2023

[abandoned] Allowing zerocopy: makeAvailable(queue, dst, srcView) #1820

Draft

4 tasks

psychocoderHPC removed this from alpaka 1.1.0 Jan 22, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add tests and example for multi GPU usage #1909

Add tests and example for multi GPU usage #1909

bernhardmgruber commented Jan 25, 2023 •

edited

Loading

fwyzard commented Jan 25, 2023 •

edited

Loading

psychocoderHPC commented Jan 25, 2023

Add tests and example for multi GPU usage #1909

Add tests and example for multi GPU usage #1909

Comments

bernhardmgruber commented Jan 25, 2023 • edited Loading

fwyzard commented Jan 25, 2023 • edited Loading

psychocoderHPC commented Jan 25, 2023

bernhardmgruber commented Jan 25, 2023 •

edited

Loading

fwyzard commented Jan 25, 2023 •

edited

Loading