Explore Projects

Discover 173 open source projects

Active filters (1):
Search: serve×
Clear all

Showing 41-60 of 173 projects

knative/serving

Knative Serving is a Kubernetes-based, scale-to-zero, request-driven compute platform for building and deploying serverless applications.

6.0K
Active
Go
Serverless
Containerization
Kubernetes
#serverless#autoscaling#containers

gothinkster/react-redux-realworld-example-app

An exemplary real-world application built with React and Redux, serving as a learning resource for developers.

5.6K
Archived
JavaScript
Component Libraries (React)
Frontend Frameworks
React
#react#redux#example-project

lightdash/lightdash

Lightdash is a self-serve BI tool that helps data teams 10x their productivity with data visualization and analytics.

5.6K
Active
TypeScript
Data Analytics
ORMs & Query Builders
TypeScript
#business-intelligence#data-analytics#data-visualization

OpenBMB/ToolBench

An open platform for training, serving, and evaluating large language models for tool learning.

5.5K
Experimental
Python
LLM Frameworks
Agents & Orchestration
Python
#large-language-models#tool-learning#open-source

volcano-sh/volcano

A Cloud Native Batch System for running AI/ML workloads on Kubernetes at scale.

5.4K
Active
Go
ML Ops
API Frameworks
Kubernetes
#ai#batch-processing#kubernetes

superduper-io/superduper

Superduper is an end-to-end framework for building custom AI applications and agents using Python, PyTorch, and Transformers.

5.3K
Stable
Python
LLM Frameworks
Agents & Orchestration
PyTorch
#ai#chatbot#mlops

brianfrankcooper/YCSB

YCSB is a popular open-source load testing framework for cloud serving systems written in Java.

5.2K
Stable
Java
API Frameworks
Testing
Java
#load-testing#benchmarking#cloud-services

flashinfer-ai/flashinfer

A Python library for serving large language models (LLMs) with high performance, including GPU acceleration and distributed inference.

5.1K
Active
Python
LLM Frameworks
Inference
PyTorch
#llm#inference#cuda

microsoft/SPTAG

A high-quality, distributed vector search library for large-scale AI and machine learning applications.

5.0K
Active
C++
Vector Search
API Frameworks
#approximate-nearest-neighbor-search#distributed-serving#space-partition-tree

kvcache-ai/Mooncake

Mooncake is a serving platform for Kimi, a leading LLM service provided by Moonshot AI, focused on disaggregation, inference, and RDMA.

4.9K
Active
C++
LLM Frameworks
API Frameworks
C++
#llm#inference#rdma

tencentmusic/cube-studio

An open-source cloud-native AI platform for ML/DL workflows, model serving, and distributed training.

4.9K
Stable
Python
MLOps
BaaS Platforms
PyTorch
#ai-platform#mlops#model-serving

SylphAI-Inc/LLM-engineer-handbook

A curated list of Large Language Model resources for training, serving, fine-tuning, and building LLM applications.

4.7K
Stable
LLM Frameworks
LLM Wrappers & SDKs
Node
#large-language-models#llm-development#llm-training

SeldonIO/seldon-core

An MLOps framework for packaging, deploying, monitoring, and managing machine learning models.

4.7K
Active
Go
Kubernetes
#machine-learning#mlops#production-machine-learning

SPLWare/esProc

esProc SPL is a JVM-based programming language for structured data computation, serving as both a data analysis tool and an embedded computing engine.

4.7K
Active
Java
Databases
Dataset
#cluster-computing#sql#database

lm-sys/RouteLLM

Framework for routing LLM requests to optimize costs while maintaining response quality

4.7K
Archived
Python
LLM Frameworks
AI Model Serving
Python
#llm-router#cost-optimization#inference-framework

gpustack/gpustack

Optimize AI inference performance on GPUs with this Python library for selecting and tuning inference engines.

4.6K
Active
Python
Inference
CLI Tools
Python
#ai-inference#gpu-acceleration#performance-optimization

ahkarami/Deep-Learning-in-Production

A repository sharing notes and references on deploying deep learning models in production.

4.4K
Archived
React
#deep-learning#production#model-serving

imazen/imageflow

High-performance image manipulation library for web servers, supporting image compression, processing, and serving.

4.4K
Active
Rust
API Frameworks
Backend Frameworks
#image-compression#image-manipulation#image-server

pytorch/serve

Serves, optimizes, and scales PyTorch models for production use

4.4K
Stable
Java
PyTorch
#serving#optimization#scaling

ericniebler/range-v3

A C++14/17/20 range library that serves as the basis for C++20's std::ranges

4.4K
Experimental
C++
CLI Tools
API Frameworks
#c++#iterator#proposal
124...9

Stay in the loop

Get weekly updates on trending AI coding tools and projects.