GitHub - swag2198/Parallel-Programming: Parallelized codes for different compute operations in CUDA, OpenMP and MPI libraries.

Parallel-Programming

This repository contains codes for the assignments of High Performance Parallel Programming (CS61064) at IIT Kharagpur during Spring 2020. Here is a brief overview of what will be found inside the folders.

Name	Brief description
CUDA	NVIDIA GPU kernel implementations (CUDA C) for different compute operations like Reduction, 2D Convolution, Matrix Transpose and Dot Product. Exploits different concepts like Thread packing in blocks, Global memory access coalescing, Shared memory accesses and bank conflicts to reduce overhead in (typically) Tesla K40 or K80 GPUs.
OpenMP	Implementation (C) for performing rotation of an object (given in terms of points) about a given axis in 3D cartesian coordinates using parallelized matrix multiplication operations.
MPI	Distributed memory (MPI C) implementation for Histogram equalisation and Sobel Filtering of an input image.

The Colab Notebooks folder contains some experiments I performed to ensure the proper functioning of the kernels and the correctness of the shared memory optimisations. (basically debugging!) These notebooks contain 2D Convolution (naive and shared memory implementations), Matrix Transpose and Dot Product Reduction kernels. The notebooks also contain detailed nvprof profiling and CUDA MEMCHECK checks for the GPU codes.

It is worthwhile to note that much of the optimisations involving global memory access coalescing did not work as expected for later generation GPUs like Pascal, Maxwell and Turing. I found this stack overflow post that also tries to explain the anomaly that I encountered. Another useful SO post that explains nvprof option for bandwidth is here.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Parallel-Programming

About

Releases

Packages

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 12 Commits
CUDA		CUDA
Colab Notebooks		Colab Notebooks
MPI		MPI
OpenMP		OpenMP
.gitignore		.gitignore
README.md		README.md

swag2198/Parallel-Programming

Folders and files

Latest commit

History

Repository files navigation

Parallel-Programming

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages