Showing 1-11 of 11 projects
A distributed system for running large language models (LLMs) on personal devices, enabling faster fine-tuning and inference.
A Python library for running large language models on a single GPU for high-throughput scenarios.
Delivers infrastructure for agentic apps with AI-native proxy and data plane.
A secure, Rust-based Virtual Machine Monitor for modern cloud workloads with support for Windows and Linux guests.
useWorker() is a React Hook that allows you to offload blocking tasks to a web worker for a more responsive UI.
Run Mixtral-8x7B language models on Colab or consumer desktops with offloading capabilities.
A simple library to move a function or class to a web worker, enabling developers to offload CPU-intensive tasks.
AIStore: A scalable, high-performance, and high-availability storage solution for AI applications and workloads.
A state management library that offloads store management to a web worker for improved performance.
Gluten is a Scala library that offloads JVM-based SQL engines' execution to native engines for improved performance.
A peer-to-peer CDN engine for HLS-based video streaming that uses WebRTC to offload traffic from the server.
Get weekly updates on trending AI coding tools and projects.