Paper Link

The appetite for higher and higher 3D graphics quality continues to drive GPU computing requirements. To satisfy these demands, GPU vendors are moving towards new architectures, such as MCM-GPU and multi-GPUs, that connect multiple chip modules or GPUs with high-speed links (e.g., NVLink and XGMI) to provide higher computing capability.

Unfortunately, it is not clear how to adequately parallelize the rendering pipeline to take advantage of these resources while maintaining low rendering latencies. Current implementations of Split Frame Rendering (SFR) are bottlenecked by redundant computations and sequential inter-GPU synchronization, and fail to scale as the GPU count increases.

In this paper, we propose CHOPIN, a novel SFR scheme for multi-GPU systems that exploits the parallelism available in image composition to eliminate the bottlenecks inherent to existing solutions. CHOPIN composes opaque sub-images out-of-order, and leverages the associativity of image composition to compose adjacent sub-images of transparent objects asynchronously. To mitigate load imbalance across GPUs and avoid inter-GPU network congestion, CHOPIN includes two new scheduling mechanisms: a draw-command scheduler and an image composition scheduler. Detailed cycle-level simulations on eight real-world game traces show that, in an 8-GPU system, CHOPIN offers speedups of up to 1.56× (1.25× gmean) compared to the best prior SFR implementation.

Paper Link

CHOPIN: Scalable Graphics Rendering in Multi-GPU Systems via Parallel Image Composition, in HPCA-2021 (to appear).

Multi-GPU-ATTILA

This work was implemented upon ATTILA. The original README of the ATTILA that we used can be found at here.

For multi-GPU graphics rendering, our baseline implementation is to duplicate all primitives in every GPU (i.e., similar to NVIDIA's NVLink and AMD's CrossFire). We also implemented both GPUpd and CHOPIN.

A template config for multi-GPU simulation can be found at here. All benchmarks that we used can be downloaded from here.

ParKD

In CHOPIN, we use ParKD to divide draw calls into multiple groups, and partition transparent draw calls among GPUs. The original README of the ParKD that we used can be found at here.

Before running ParKD, you should run ATTILA first with single-GPU configuration and profileForSortLast being enabled. The generated profiling results (inputVertexes.obj, blendInfo.obj, zTestInfo.obj, and zFightingInfo.obj) are inputs of ParKD. The binary output file of ParKD will then be used as an input of CHOPIN simulation in ATTILA.

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
multigpu_graphics_attila		multigpu_graphics_attila
parkd		parkd
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Paper Link

Multi-GPU-ATTILA

ParKD

About

Releases

Packages

Languages

compstruct/Scalable-Multi-GPU-Rendering

Folders and files

Latest commit

History

Repository files navigation

Paper Link

Multi-GPU-ATTILA

ParKD

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages