Explore Projects

Discover 8 open source projects

Active filters (1):
Search: cuda-kernelsร—
Clear all

Showing 1-8 of 8 projects

xlite-dev/LeetCUDA

LeetCUDA is a comprehensive collection of modern CUDA learning resources, including 200+ CUDA kernels, Tensor Cores, HGEMM, and FA-2 MMA.

9.8K
Active
Cuda
ML Ops
PyTorch
#cuda#cuda-toolkit#cuda-demo

NVIDIA/cuda-samples

NVIDIA CUDA samples that demonstrate features of the CUDA Toolkit for GPU-accelerated development.

8.9K
Active
C
AI SDKs & Wrappers
CLI Tools
#cuda#gpu-acceleration#nvidia

InternLM/lmdeploy

LMDeploy is a toolkit for compressing, deploying, and serving large language models (LLMs).

7.7K
Active
Python
LLM Frameworks
Inference
Python
#llm#inference#deployment

Rust-GPU/rust-cuda

Rust-CUDA is an ecosystem of libraries and tools for writing and executing fast GPU code fully in Rust.

5.1K
Active
Rust
API Frameworks
CLI Tools
Rust
#cuda#gpu-programming#rust-lang

NVIDIA/cccl

NVIDIA's CUDA Core Compute Libraries for accelerated computing and GPU programming in C++

2.2K
Active
C++
AI SDKs & Wrappers
CLI Tools
#accelerated-computing#cuda#gpu-programming

chelsea0x3b/dfdx

Deep learning library for Rust with shape-checked tensors and neural networks

1.9K
Archived
Rust
LLM Frameworks
Rust
#deep-learning#neural-networks#autodiff

ELS-RD/kernl

Kernl is a library that lets you run PyTorch transformer models several times faster on GPU with a single line of code.

1.6K
Active
Jupyter Notebook
LLM Frameworks
API Frameworks
PyTorch
#cuda#transformer#triton

chelsea0x3b/cudarc

Safe Rust wrapper around the CUDA toolkit for GPU acceleration in AI/ML applications.

1.1K
Active
Rust
GPU Acceleration
CLI Tools
Rust
#cuda#gpu#rust

Stay in the loop

Get weekly updates on trending AI coding tools and projects.