MyNixOS website logo
categories

development/cuda-modules

Showing entries 1-70 out of 70.
packagesNix package categories
Wrapper substituting the deprecated runfile-based CUDA installation
Samples for CUDA Developers which demonstrates features in CUDA Toolkit
Building blocks that make it easier to write safe and efficient CUDA C++ code
Provides minor version forward compatibility for the CUDA runtime
By downloading and using this package you accept the terms and conditions of the associated licens…
Analyzes trace files containing compilation time information generated by NVCC or NVRTC
CUDA Runtime
By downloading and using this package you accept the terms and conditions of the associated licens…
Extracts information from CUDA binary files (both standalone and those embedded in host binaries) …
C-based interface for creating profiling and tracing tools designed for CUDA applications
Decode low-level identifiers that have been mangled by CUDA C++ into user readable names
Pre-built applications which use CUDA
By downloading and using this package you accept the terms and conditions of the associated licens…
NVIDIA tool for debugging CUDA applications on Linux and QNX systems
Nsight Eclipse Plugins Edition
CUDA compiler driver
Extracts information from standalone cubin files and presents them in human readable format
C-based programmatic interface for monitoring and managing various states within Data Center GPUs
Collect and view profiling data from the command-line
Prune host object files and libraries to only contain device code for the specified targets
Runtime compilation library for CUDA C++
C-based Application Programming Interface (API) for annotating events, code ranges, and resources …
Cross-platform performance profiling tool for optimizing CUDA C/C++ applications
Low-level API for heterogeneous computing that runs on CUDA-powered GPUs
API for profiling CUDA runtime
Enables the creation of sanitizing and tracing tools that target CUDA applications
GPU-accelerated library of primitives for deep neural networks
Set of high-performance libraries and tools for accelerating quantum computing simulations at both…
CUDA Templates for Linear Algebra Subroutines
By downloading and using this package you accept the terms and conditions of the associated licens…
Fast GPU memory copy library based on NVIDIA GPUDirect RDMA technology
Service which supports GPU memory export and import (NVLink P2P) and shared memory operations acro…
CUDA Basic Linear Algebra Subroutine library
High-performance, multi-process, GPU-accelerated library for distributed basic dense linear algebr…
By downloading and using this package you accept the terms and conditions of the associated licens…
Library of GPU-accelerated linear solvers with sparse matrices
High-performance FFT product CUDA library
Library to leverage GDS technology
Helper module for the cuBLASMp library that allows it to efficiently perform communications betwee…
Collection of dense and sparse direct linear solvers and Eigen solvers
High-performance, distributed-memory, GPU-accelerated library that provides tools for solving dens…
GPU-accelerated basic linear algebra subroutines for sparse matrix computations for unstructured s…
High-performance CUDA library dedicated to general matrix-matrix operations in which at least one …
GPU-accelerated tensor linear algebra library for tensor contraction, reduction, and elementwise o…
Library of primitives for image and signal processing
C++ support for interfacing with the NVIDIA Performance Primitives (NPP) library
APIs which can be used at runtime to combine multiple CUDA objects into one CUDA fat binary (fatbi…
APIs which can be used at runtime to link together GPU device code
Provides high-performance, GPU accelerated JPEG decoding functionality for image formats commonly …
Accelerates the decoding and encoding of JPEG2000 images on NVIDIA GPUs
APIs which can be used to compile a PTX program into GPU assembly code
Parallel programming interface for NVIDIA GPUs based on OpenSHMEM
Accelerates TIFF encode/decode on NVIDIA GPUs
Interface for generating PTX code from both binary and text NVVM IR inputs
Multi-GPU and multi-node collective communication primitives for NVIDIA GPUs
Tests to check both the performance and the correctness of NVIDIA NCCL operations
Interactive profiler for CUDA and NVIDIA OptiX
System-wide performance analysis and visualization tool
High-speed data compression and decompression library optimized for NVIDIA GPUs
GPUDirect Storage kernel driver to read/write data from supported storage using cufile APIs
Part of NVIDIA Performance Libraries that provides standard Fortran 77 BLAS APIs as well as C (CBL…
Common part of NVIDIA Performance Libraries
Perform Fast Fourier Transform (FFT) calculations on ARM CPUs
Part of NVIDIA Performance Libraries that provides standard Fortran 90 LAPACK and LAPACKE APIs
Collection of efficient pseudorandom and quasirandom number generators for ARM CPUs
Provides an optimized implementation of ScaLAPACK for distributed-memory architectures
Provides a set of CPU-accelerated basic linear algebra subroutines used for handling sparse matric…
Part of NVIDIA Performance Libraries that provides tensor primitives
SDK that facilitates high-performance machine learning inference