Explore Projects

Discover 31 open source projects

Active filters (1):
Search: vllmร—
Clear all

Showing 21-31 of 31 projects

OpenDCAI/DataFlow

LLMs-based Operators and Pipelines for data prep

2.9K
Active
Python
AI Coding Tools
Gradio
#data-science#data-agent#data-cleaning

containers/ramalama

RamaLama simplifies local serving of AI models and enables their use for inference in production via containers.

2.6K
Active
Python
LLM Frameworks
Inference
Python
#ai#containers#inference-server

mostlygeek/llama-swap

Reliable model swapping for local LLM servers - seamlessly switch between llama.cpp, vLLM, and compatible backends

2.6K
Active
Go
Local Inference Engines
LLM Wrappers & SDKs
llama.cpp
#local-llm#model-swapping#llama-cpp

NVIDIA/Model-Optimizer

A Python library for optimizing deep learning models for faster inference on deployment platforms like TensorRT.

2.1K
Active
Python
Inference
CLI Tools
#deep-learning#model-optimization#quantization

apconw/Aix-DB

A LangChain-based framework for end-to-end natural language to data insight conversion, with MCP Skills multi-agent architecture.

2.0K
Active
JavaScript
LLM Frameworks
MCP Frameworks
LangChain
#llm#langchain#mcp

vllm-project/vllm-ascend

A community-maintained hardware plugin for running large language models (LLMs) on Ascend accelerators.

1.7K
Active
C++
LLM Frameworks
Inference
#ascend#llm-serving#llmops

bricks-cloud/BricksLLM

Enterprise-grade API gateway for monitoring and managing costs/rates across LLMs like OpenAI, Anthropic, and Azure OpenAI.

1.2K
Archived
Go
API Clients & Testing
API Documentation
Golang
#ai#llm#openai

kubeai-project/kubeai

An AI inference operator for Kubernetes that makes it easy to serve ML models in production.

1.2K
Active
Go
Inference
BaaS Platforms
#ai#kubernetes#inference

Alpha-VLLM/Lumina-mGPT-2.0

Lumina-mGPT 2.0 is a stand-alone autoregressive image modeling tool powered by Python.

1.1K
Stable
Python
Computer Vision
LLM Frameworks
Python
#computer-vision#autoregressive-modeling#image-generation

Ksuriuri/index-tts-vllm

Adds support for very large language models (vLLMs) to IndexTTS, enabling faster AI-powered text-to-speech inference.

1.1K
Stable
Python
LLM Frameworks
Inference
Python
#text-to-speech#llm#inference

prometheus-eval/prometheus-eval

A Python library to evaluate the response of large language models like GPT-4 using Prometheus metrics.

1.1K
Experimental
Python
LLM Frameworks
Testing
Python
#llm#gpt4#evaluation
1

Stay in the loop

Get weekly updates on trending AI coding tools and projects.