Skip to content
@NVIDIA

NVIDIA Corporation

Pinned Loading

  1. cuopt cuopt Public

    NVIDIA cuOpt is an open-source GPU-accelerated optimization engine delivering near real-time solutions for complex decision-making challenges.

    Cuda 316 50

  2. cuopt-examples cuopt-examples Public

    NVIDIA cuOpt examples for decision optimization

    Jupyter Notebook 343 47

  3. open-gpu-kernel-modules open-gpu-kernel-modules Public

    NVIDIA Linux open GPU kernel module source

    C 16k 1.4k

  4. aistore aistore Public

    AIStore: scalable storage for AI applications

    Go 1.6k 214

  5. nvidia-container-toolkit nvidia-container-toolkit Public

    Build and run containers leveraging NVIDIA GPUs

    Go 3.5k 380

  6. GenerativeAIExamples GenerativeAIExamples Public

    Generative AI reference workflows optimized for accelerated infrastructure and microservice architecture.

    Jupyter Notebook 3.3k 792

Repositories

Showing 10 of 584 repositories
  • TensorRT-LLM Public

    TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and support state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. TensorRT-LLM also contains components to create Python and C++ runtimes that orchestrate the inference execution in performant way.

    NVIDIA/TensorRT-LLM’s past year of commit activity
    C++ 11,120 Apache-2.0 1,617 705 357 Updated Jul 25, 2025
  • TransformerEngine Public

    A library for accelerating Transformer models on NVIDIA GPUs, including using 8-bit floating point (FP8) precision on Hopper, Ada and Blackwell GPUs, to provide better performance with lower memory utilization in both training and inference.

    NVIDIA/TransformerEngine’s past year of commit activity
    Python 2,577 Apache-2.0 463 203 68 Updated Jul 25, 2025
  • JAX-Toolbox Public

    JAX-Toolbox

    NVIDIA/JAX-Toolbox’s past year of commit activity
    Python 324 Apache-2.0 61 79 37 Updated Jul 25, 2025
  • cuEquivariance Public

    cuEquivariance is a math library that is a collective of low-level primitives and tensor ops to accelerate widely-used models, like DiffDock, MACE, Allegro and NEQUIP, based on equivariant neural networks.

    NVIDIA/cuEquivariance’s past year of commit activity
    Python 267 17 7 4 Updated Jul 25, 2025
  • Fuser Public

    A Fusion Code Generator for NVIDIA GPUs (commonly known as "nvFuser")

    NVIDIA/Fuser’s past year of commit activity
    C++ 341 61 187 (11 issues need help) 161 Updated Jul 25, 2025
  • NeMo-Guardrails Public

    NeMo Guardrails is an open-source toolkit for easily adding programmable guardrails to LLM-based conversational systems.

    NVIDIA/NeMo-Guardrails’s past year of commit activity
    Python 4,918 510 123 (5 issues need help) 39 Updated Jul 25, 2025
  • recsys-examples Public

    Examples for Recommenders - easy to train and deploy on accelerated infrastructure.

    NVIDIA/recsys-examples’s past year of commit activity
    Python 91 23 26 (1 issue needs help) 8 Updated Jul 25, 2025
  • NeMo Public

    A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)

    NVIDIA/NeMo’s past year of commit activity
    Python 15,204 Apache-2.0 3,013 51 102 Updated Jul 25, 2025
  • cudaqx Public

    Accelerated libraries for quantum-classical computing built on CUDA-Q.

    NVIDIA/cudaqx’s past year of commit activity
    C++ 49 23 21 (1 issue needs help) 7 Updated Jul 25, 2025
  • cuda-python Public

    CUDA Python: Performance meets Productivity

    NVIDIA/cuda-python’s past year of commit activity
    Python 2,859 182 144 7 Updated Jul 25, 2025