Explore Projects

Discover 11 open source projects

Active filters (1):
Search: offloadingร—
Clear all

Showing 1-11 of 11 projects

bigscience-workshop/petals

A distributed system for running large language models (LLMs) on personal devices, enabling faster fine-tuning and inference.

10.0K
Archived
Python
LLM Frameworks
PyTorch
#llm#distributed-computing#fine-tuning

FMInference/FlexLLMGen

A Python library for running large language models on a single GPU for high-throughput scenarios.

9.4K
Archived
Python
LLM Frameworks
#large-language-models#high-throughput#gpu-optimization

katanemo/plano

Delivers infrastructure for agentic apps with AI-native proxy and data plane.

5.9K
Active
Rust
Rust
#proxy#gateway#LLM

cloud-hypervisor/cloud-hypervisor

A secure, Rust-based Virtual Machine Monitor for modern cloud workloads with support for Windows and Linux guests.

5.4K
Active
Rust
API Frameworks
Containerization
#virtualization#kvm#cloud-workloads

alewin/useWorker

useWorker() is a React Hook that allows you to offload blocking tasks to a web worker for a more responsive UI.

3.1K
Active
JavaScript
Component Libraries (React)
CLI Tools
React
#background-tasks#web-workers#react-hook

dvmazur/mixtral-offloading

Run Mixtral-8x7B language models on Colab or consumer desktops with offloading capabilities.

2.3K
Archived
Python
LLM Frameworks
BaaS Platforms
PyTorch
#language-model#offloading#quantization

pshihn/workly

A simple library to move a function or class to a web worker, enabling developers to offload CPU-intensive tasks.

1.9K
Archived
JavaScript
JavaScript
#web-worker#thread#javascript

NVIDIA/aistore

AIStore: A scalable, high-performance, and high-availability storage solution for AI applications and workloads.

1.8K
Active
Go
API Frameworks
Databases
Go
#distributed-storage#object-storage#s3-compatible

developit/stockroom

A state management library that offloads store management to a web worker for improved performance.

1.8K
Archived
JavaScript
State Management
CLI Tools
React
#state-management#web-worker#performance

apache/incubator-gluten

Gluten is a Scala library that offloads JVM-based SQL engines' execution to native engines for improved performance.

1.5K
Active
Scala
API Frameworks
Databases
Scala
#spark-sql#clickhouse#simd

cdnbye/hlsjs-p2p-engine

A peer-to-peer CDN engine for HLS-based video streaming that uses WebRTC to offload traffic from the server.

1.1K
Active
Component Libraries (React)
API Frameworks
React
#cdn#p2p-cdn#p2p-video-streaming

Stay in the loop

Get weekly updates on trending AI coding tools and projects.