CUDA Ray Tracing

Description

This project implements ray tracing using CUDA for computation and OpenGL for rendering. It leverages the cuda_gl_interop library provided by Nvidia for efficient data transfer between CUDA and OpenGL. The implementation includes various geometric primitives, bounding volume hierarchy (BVH) for acceleration, and depth of field effects.

⚠️ Warning: This project was intended to be a technical and fast experimentation. The code here does not reflect my full capabilities in terms of code architecture and optimization. I just wanted to spend 5 days learning as much new technical notions as possible. There are still technical points that I need to fix later also.

Implemented Features

Geometric Primitives

The project supports various geometric primitives, including:

Bezier surfaces
Squares
Spheres
Cylinders
Meshes

Bounding Volume Hierarchy (BVH)

BVH is an acceleration structure that enhances rendering performance by organizing geometric primitives within a hierarchical tree of bounding volumes. This structure allows the ray tracing algorithm to quickly eliminate large portions of the scene that do not intersect with a given ray, thereby reducing the number of intersection tests required.

Depth of Field

Depth of field (DoF) is an effect that simulates the focus properties of a camera lens. In ray tracing, DoF is achieved by:

Aperture: The aperture size determines the depth of field; a larger aperture results in a shallower depth of field.
Focal Distance: This is the distance from the camera at which objects are in perfect focus. Rays are sampled within the aperture and focused on the focal plane, creating a natural blur for objects not in focus.

Screenshots

Purple reflective sphere: First test with DOF (Depth of Field) ON and 128 samples, rendered in less than 1s	Orange Scene: Multiple balls rendered with 64 samples & DOF ON rendered in ~8s
Low Quality Purple Statue Scene: Scene with multiple objects and the Stanford Lucy Statue (34k triangles) (1 sample) (1280x720), rendered in ~10s	High Quality Purple Statue Scene: Scene with multiple objects and the Stanford Lucy Statue (34k triangles) (32 samples) (3840x2160), rendered in ~3h
Diffused Crystal Statue Scene: Scene with multiple objects and the Stanford Lucy Statue with crystal like properties mixed with white diffuse color (34k triangles) (32 samples) (1920x1080), rendered in ~1h	Final Scene: Scene with a Stanford Lucy Statue and Stanford Bunny (34k + 70k triangles) (128 samples) (1920x1080), rendered in ~2h

Getting Started

Prerequisites

CUDA-capable GPU (I used my own laptop for this project, a RTX 3070 Laptop with Ampere architecture)
CUDA Toolkit
OpenGL

Installation

Clone the repository:

git clone https://github.com/yourusername/CUDA_RayTracing.git

Open the Visual Studio solution in CUDA_RayTracing\CUDA_RayTracing.sln.
Build the solution.
Launch CUDA_RayTracing.exe.

User Interaction

Load scenes from the ImGUI GUI.
Choose between continuously generating frames or rendering a single image.
Set depth of field, aperture, and focal distance.
Take screenshots (GUI is not included in the screenshots).
Change texture size (number of pixels processed by CUDA).
Adjust viewport size (window size where the rendered image is displayed).

Performance Profiling

Use Nvidia NSight Compute for detailed performance analysis and optimization of CUDA kernels.

Key Features

Ray Tracing with CUDA and OpenGL: Efficient integration using cuda_gl_interop.
Geometric Primitives: Supports bezier surfaces, squares, spheres, cylinders, and meshes.
Bounding Volume Hierarchy (BVH): Optimizes rendering by efficiently managing spatial data.
Depth of Field: Simulates camera focus with adjustable aperture and focal distance.

Resources

Notes

Visual Studio Configuration: Ensure Properties/Cuda C/C++/Common/Generate Relocatable Device Code is enabled for external symbol linking.
GPU Architecture: For an RTX 3070 Laptop, use sm_86 architecture.
Thread and Block Configuration: Utilize a 16x16 thread block size. Calculate the number of blocks as (width + block_size - 1) / block_size and (height + block_size - 1) / block_size.
Performance Profiling: Use Nvidia NSight Compute for performance profiling.
Stack Size Management: Manage stack size with cudaDeviceSetLimit(cudaLimitStackSize, N).
Random Seed Generation: Ensure unique seeds for each device to avoid correlation in random number generation.

Name		Name	Last commit message	Last commit date
Latest commit History 48 Commits
CUDA_RayTracing		CUDA_RayTracing
lib		lib
.gitignore		.gitignore
README.md		README.md
test.ipynb		test.ipynb
test.py		test.py
test2.py		test2.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

CUDA Ray Tracing

Description

Implemented Features

Geometric Primitives

Bounding Volume Hierarchy (BVH)

Depth of Field

Screenshots

Getting Started

Prerequisites

Installation

User Interaction

Performance Profiling

Key Features

Resources

Notes

About

Releases

Packages

Languages

kevinpruvost/CUDA_RayTracing

Folders and files

Latest commit

History

Repository files navigation

CUDA Ray Tracing

Description

Implemented Features

Geometric Primitives

Bounding Volume Hierarchy (BVH)

Depth of Field

Screenshots

Getting Started

Prerequisites

Installation

User Interaction

Performance Profiling

Key Features

Resources

Notes

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages