Explore Projects

Discover 133 open source projects

Active filters (1):
Search: cudaร—
Clear all

Showing 1-20 of 133 projects

vllm-project/vllm

High-throughput LLM inference engine for developers

72.1K
Active
Python
Inference
LLM Wrappers & SDKs
Hugging Face
#llm#inference#ai

karpathy/LLM101n

Course on building a Storyteller AI LLM from scratch in Python, C, and CUDA

36.4K
Archived
Tutorials & Courses
Inference
PyTorch
#llm#ai#deep learning

hashcat/hashcat

Password recovery tool with GPU acceleration

25.5K
Active
C
Penetration Testing
#password-cracking#hashcat#gpu-acceleration

sgl-project/sglang

High-performance serving framework for large language and multimodal models

24.1K
Active
Python
Inference
LLM Frameworks
Python
#llm#inference#serving

NVIDIA/nvidia-docker

Build and run Docker containers leveraging NVIDIA GPUs

17.5K
Archived
docker
#docker#gpu#nvidia-docker

NVlabs/instant-ngp

Lightning fast neural graphics primitives for real-time 3D reconstruction, rendering, and more.

17.3K
Stable
Cuda
Computer Vision
React
#3d-reconstruction#real-time-rendering#machine-learning

kaldi-asr/kaldi

An open-source speech recognition toolkit used for building speech recognition systems.

15.3K
Stable
Shell
Speech Recognition
#speech-recognition#speaker-identification#speaker-verification

tracel-ai/burn

Burn is a high-performance tensor library and deep learning framework for AI and scientific computing in Rust.

14.5K
Active
Rust
LLM Frameworks
#deep-learning#tensor#scientific-computing

vosen/ZLUDA

CUDA on non-NVIDIA GPUs, a Rust library for utilizing CUDA on a variety of GPU architectures.

14.0K
Active
Rust
LLM Frameworks
Rust
#cuda#gpu#non-nvidia

isl-org/Open3D

Open3D is a modern C++ library for 3D data processing, including reconstruction, registration, and visualization.

13.4K
Active
C++
Computer Vision
React
#3d-perception#computer-graphics#mesh-processing

NVIDIA/TensorRT-LLM

TensorRT LLM provides a Python API and optimizations to efficiently run large language models on NVIDIA GPUs.

13.0K
Active
Python
LLM Frameworks
PyTorch
#cuda#llm-serving#moe

jamiepine/voicebox

Open-source voice synthesis studio powered by Qwen3-TTS

12.2K
Active
TypeScript
Voice AI & Synthesis
Whisper
#qwen3-tts#voice-ai#mlx

srush/GPU-Puzzles

A puzzle-based learning resource for developers to explore CUDA and machine learning.

12.0K
Archived
Jupyter Notebook
Computer Vision
#cuda#machine-learning#puzzles

taskflow/taskflow

A high-performance task-parallel programming system for C++ developers building concurrent and heterogeneous applications.

11.8K
Active
C++
#concurrent-programming#gpu-programming#heterogeneous-parallel-programming

numba/numba

NumPy-aware dynamic Python compiler using LLVM, enabling fast, high-performance array and numerical computing.

10.9K
Active
Python
ML Ops
NumPy
#compiler#cuda#llvm

cupy/cupy

A GPU-accelerated NumPy & SciPy library for high-performance scientific computing

10.8K
Active
Python
ML Ops
Python
#cuda#gpu#numpy

xlite-dev/LeetCUDA

LeetCUDA is a comprehensive collection of modern CUDA learning resources, including 200+ CUDA kernels, Tensor Cores, HGEMM, and FA-2 MMA.

9.8K
Active
Cuda
ML Ops
PyTorch
#cuda#cuda-toolkit#cuda-demo

rapidsai/cudf

A high-performance GPU DataFrame library for data analysis and machine learning workloads.

9.5K
Active
C++
Databases
Python
#data-analysis#data-science#gpu

Oneflow-Inc/oneflow

OneFlow is a deep learning framework designed to be user-friendly, scalable and efficient.

9.4K
Stable
C++
ML Ops
C++
#deep-learning#cuda#distributed

NVIDIA/cutlass

A high-performance linear algebra library for GPU-accelerated deep learning and other applications.

9.4K
Active
C++
ML Ops
API Frameworks
C++
#cuda#deep-learning#gpu

Stay in the loop

Get weekly updates on trending AI coding tools and projects.