Showing 81-100 of 133 projects
SCUDA is a GPU over IP bridge allowing GPUs on remote machines to be attached to CPU-only machines.
PPL Quantization Tool (PPQ) is a powerful offline neural network quantization tool.
A high-performance Fast Fourier Transform (FFT) library for Vulkan, CUDA, HIP, OpenCL, Level Zero, and Metal.
A curated collection of YOLO object detection projects and datasets for developers working with computer vision and AI.
ILGPU is a high-performance .NET GPU compiler that enables developers to write and run GPU programs in C#.
A large-scale LLM inference engine built in C++ with support for various AI hardware accelerators.
A fast and scalable SVM library for classification and regression tasks on GPUs and CPUs.
A C++ project for deploying large language models (LLMs) like ChatGLM and Baichuan using the MNN framework.
A fast, efficient open-source C++ library for point cloud registration using GICP algorithms.
Ultrafast serverless GPU inference, sandboxes, and background jobs for AI-focused developers.
Kernl is a library that lets you run PyTorch transformer models several times faster on GPU with a single line of code.
An implementation of the 3D Ken Burns Effect from a single image using PyTorch.
Example code for custom CUDA operators in PyTorch
Open-source CUDA compiler that targets AMD GPUs, compiling CUDA code to GFX11/12 machine code.
A real-time dashboard for monitoring NVIDIA GPU usage and performance metrics.
TornadoVM is a heterogeneous programming framework that enables developers to leverage GPUs and other accelerators for improved performance in Java applications.
A Python repository for experimenting with deep learning and computer vision using images of cats.
MatConvNet: A deep learning framework for MATLAB, utilizing CUDA.
A high-performance C++ library for neural machine translation, with CUDA support for GPU acceleration.
A C++ library for accelerating YOLO-based computer vision models using NVIDIA's TensorRT framework.
Get weekly updates on trending AI coding tools and projects.