Explore Projects

Discover 62 open source projects

Active filters (1):
Search: gpusร—
Clear all

Showing 21-40 of 62 projects

turboderp-org/exllamav2

A fast inference library for running large language models (LLMs) locally on modern GPUs

4.5K
Stable
Python
LLM Frameworks
CLI Tools
Python
#machine-learning#inference#llm

openxla/xla

A machine learning compiler for GPUs, CPUs, and ML accelerators, useful for building AI-powered applications.

4.0K
Active
C++
ML Ops
API Frameworks
#machine-learning#compiler#gpu

ilya-zlobintsev/LACT

A Linux-based GPU configuration and monitoring tool written in Rust for AMD and Nvidia GPUs.

4.0K
Active
Rust
CLI Tools
Background Jobs
#gpu#linux#amdgpu

acidanthera/WhateverGreen

A library of various patches necessary for certain ATI/AMD/Intel/Nvidia GPUs on macOS.

3.4K
Stable
C++
Firmware & Drivers
#gpu#drivers#patches

NVIDIA/TransformerEngine

A high-performance Transformer library for accelerating AI models on NVIDIA GPUs, including low-precision support.

3.2K
Active
Python
LLM Frameworks
Inference
PyTorch
#deep-learning#gpu#cuda

ARM-software/ComputeLibrary

A computer vision and machine learning library optimized for Arm CPUs and GPUs using SIMD technologies.

3.1K
Active
C++
Computer Vision
API Frameworks
#computer-vision#machine-learning#arm

pytorch/TensorRT

PyTorch compiler for NVIDIA GPUs using TensorRT, enabling efficient deep learning inference on CUDA hardware.

3.0K
Active
Python
ML Ops
API Frameworks
PyTorch
#deep-learning#cuda#nvidia

BBuf/how-to-optim-algorithm-in-cuda

This repository provides guidance on optimizing algorithms for CUDA, a framework for parallel computing on NVIDIA GPUs.

2.8K
Active
Cuda
LLM Frameworks
CLI Tools
CUDA
#cuda#optimization#parallel-computing

NVIDIA/gpu-operator

NVIDIA GPU Operator manages GPUs in Kubernetes for developers building AI-powered applications.

2.6K
Active
Go
ML Ops
Containerization
Kubernetes
#gpu#cuda#kubernetes

microsoft/DirectML

DirectML is a high-performance, hardware-accelerated DirectX 12 library for machine learning tasks on a variety of GPUs.

2.5K
Active
C++
AI SDKs & Wrappers
API Frameworks
#machine-learning#gpu-acceleration#directx12

openlit/openlit

An open-source platform for AI engineering with LLM observability, GPU monitoring, and prompt management tools.

2.3K
Active
Python
LLM Frameworks
Monitoring
Python
#ai-observability#gpu-monitoring#llmops

NVIDIA/cutile-python

cuTile is a programming model for writing parallel kernels for NVIDIA GPUs in Python.

2.0K
Active
Python
GPU
#gpu#kernel#parallel-kernels

XiongjieDai/GPU-Benchmarks-on-LLM-Inference

Compares the performance of multiple NVIDIA GPUs and Apple Silicon for running large language model inference.

1.9K
Archived
Jupyter Notebook
LLM Inference
Benchmarking
#benchmarking#large-language-models#gpu-performance

kevmo314/scuda

SCUDA is a GPU over IP bridge allowing GPUs on remote machines to be attached to CPU-only machines.

1.8K
Active
C++
GPU-Accelerated ML
Networking
#cuda#cublas#cudnn

AdaptiveCpp/AdaptiveCpp

A community-driven C++ compiler for heterogeneous programming models like SYCL, C++ parallelism, and CUDA/HIP.

1.8K
Active
C++
API Frameworks
Build Tools
#compiler#gpu-computing#high-performance-computing

nestrilabs/nestri

A cloud-based platform for deploying and streaming games/apps, with support for various gaming technologies.

1.7K
Stable
TypeScript
BaaS Platforms
Containerization
Next.js
#gaming#cloud-streaming#linux-gaming

Xtra-Computing/thundersvm

A fast and scalable SVM library for classification and regression tasks on GPUs and CPUs.

1.6K
Archived
C++
ML Ops
API Frameworks
#classification#regression#gpu

Tencent/tgfx

A high-performance 2D graphics library for modern GPUs, supporting text, image, and vector rendering across platforms.

1.5K
Active
C++
Component Libraries (React)
2D Graphics
React
#2d#graphics#rendering

Zaneham/BarraCUDA

Open-source CUDA compiler that targets AMD GPUs, compiling CUDA code to GFX11/12 machine code.

1.5K
Active
C
Build Tools
Inference
CUDA
#cuda-compiler#amd-gpu#gfx11-gfx12

NVIDIA/nccl-tests

NCCL Tests - a library for efficient multi-GPU collective communication primitives for NVIDIA GPUs

1.5K
Active
Cuda
AI SDKs & Wrappers
#cuda#gpu#nvidia

Stay in the loop

Get weekly updates on trending AI coding tools and projects.