Skip to content
@NVIDIA

NVIDIA Corporation

Pinned Loading

  1. DeepLearningExamples DeepLearningExamples Public

    State-of-the-Art Deep Learning scripts organized by models - easy to train and deploy with reproducible accuracy and performance on enterprise-grade infrastructure.

    Jupyter Notebook 14.2k 3.3k

  2. tensorflow tensorflow Public

    An Open Source Machine Learning Framework for Everyone

    C++ 1.1k 170

  3. open-gpu-kernel-modules open-gpu-kernel-modules Public

    NVIDIA Linux open GPU kernel module source

    C 15.7k 1.4k

  4. aistore aistore Public

    AIStore: scalable storage for AI applications

    Go 1.5k 203

  5. nvidia-container-toolkit nvidia-container-toolkit Public

    Build and run containers leveraging NVIDIA GPUs

    Go 3.2k 342

  6. GenerativeAIExamples GenerativeAIExamples Public

    Generative AI reference workflows optimized for accelerated infrastructure and microservice architecture.

    Python 3k 722

Repositories

Showing 10 of 570 repositories
  • numba-cuda Public

    The CUDA target for Numba

    NVIDIA/numba-cuda’s past year of commit activity
    Python 110 BSD-2-Clause 19 79 15 Updated May 3, 2025
  • edk2 Public

    NVIDIA fork of tianocore/edk2

    NVIDIA/edk2’s past year of commit activity
    C 22 14 0 15 Updated May 3, 2025
  • spark-rapids-jni Public

    RAPIDS Accelerator JNI For Apache Spark

    NVIDIA/spark-rapids-jni’s past year of commit activity
    Cuda 48 Apache-2.0 70 71 13 Updated May 3, 2025
  • Fuser Public

    A Fusion Code Generator for NVIDIA GPUs (commonly known as "nvFuser")

    NVIDIA/Fuser’s past year of commit activity
    C++ 324 58 230 (9 issues need help) 155 Updated May 3, 2025
  • cuda-python Public

    CUDA Python: Performance meets Productivity

    NVIDIA/cuda-python’s past year of commit activity
    Python 2,575 159 105 11 Updated May 3, 2025
  • TransformerEngine Public

    A library for accelerating Transformer models on NVIDIA GPUs, including using 8-bit floating point (FP8) precision on Hopper, Ada and Blackwell GPUs, to provide better performance with lower memory utilization in both training and inference.

    NVIDIA/TransformerEngine’s past year of commit activity
    Python 2,392 Apache-2.0 416 183 70 Updated May 3, 2025
  • TensorRT-LLM Public

    TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and support state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. TensorRT-LLM also contains components to create Python and C++ runtimes that orchestrate the inference execution in performant way.

    NVIDIA/TensorRT-LLM’s past year of commit activity
    C++ 10,399 Apache-2.0 1,398 559 (1 issue needs help) 218 Updated May 3, 2025
  • cuda-quantum Public

    C++ and Python support for the CUDA Quantum programming model for heterogeneous quantum-classical workflows

    NVIDIA/cuda-quantum’s past year of commit activity
    C++ 683 235 344 (13 issues need help) 52 Updated May 3, 2025
  • NeMo Public

    A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)

    NVIDIA/NeMo’s past year of commit activity
    Python 13,775 Apache-2.0 2,804 33 125 Updated May 3, 2025
  • DCGM Public

    NVIDIA Data Center GPU Manager (DCGM) is a project for gathering telemetry and measuring the health of NVIDIA GPUs

    NVIDIA/DCGM’s past year of commit activity
    C++ 500 Apache-2.0 63 108 7 Updated May 3, 2025