Explore Projects

Discover 31 open source projects

Active filters (1):

Search: vllm×

Clear all

Showing 21-31 of 31 projects

OpenDCAI/DataFlow

LLMs-based Operators and Pipelines for data prep

2.9K

Active

Python

AI Coding Tools

Gradio

#data-science#data-agent#data-cleaning

containers/ramalama

RamaLama simplifies local serving of AI models and enables their use for inference in production via containers.

2.6K

Active

Python

LLM Frameworks

Inference

Python

#ai#containers#inference-server

mostlygeek/llama-swap

Reliable model swapping for local LLM servers - seamlessly switch between llama.cpp, vLLM, and compatible backends

2.6K

Active

Local Inference Engines

LLM Wrappers & SDKs

llama.cpp

#local-llm#model-swapping#llama-cpp

NVIDIA/Model-Optimizer

A Python library for optimizing deep learning models for faster inference on deployment platforms like TensorRT.

2.1K

Active

Python

Inference

CLI Tools

#deep-learning#model-optimization#quantization

apconw/Aix-DB

A LangChain-based framework for end-to-end natural language to data insight conversion, with MCP Skills multi-agent architecture.

2.0K

Active

JavaScript

LLM Frameworks

MCP Frameworks

LangChain

#llm#langchain#mcp

vllm-project/vllm-ascend

A community-maintained hardware plugin for running large language models (LLMs) on Ascend accelerators.

1.7K

Active

C++

LLM Frameworks

Inference

#ascend#llm-serving#llmops

bricks-cloud/BricksLLM

Enterprise-grade API gateway for monitoring and managing costs/rates across LLMs like OpenAI, Anthropic, and Azure OpenAI.

1.2K

Archived

API Clients & Testing

API Documentation

Golang

#ai#llm#openai

kubeai-project/kubeai

An AI inference operator for Kubernetes that makes it easy to serve ML models in production.

1.2K

Active

Inference

BaaS Platforms

#ai#kubernetes#inference

Alpha-VLLM/Lumina-mGPT-2.0

Lumina-mGPT 2.0 is a stand-alone autoregressive image modeling tool powered by Python.

1.1K

Stable

Python

Computer Vision

LLM Frameworks

Python

#computer-vision#autoregressive-modeling#image-generation

Ksuriuri/index-tts-vllm

Adds support for very large language models (vLLMs) to IndexTTS, enabling faster AI-powered text-to-speech inference.

1.1K

Stable

Python

LLM Frameworks

Inference

Python

#text-to-speech#llm#inference

prometheus-eval/prometheus-eval

A Python library to evaluate the response of large language models like GPT-4 using Prometheus metrics.

1.1K

Experimental

Python

LLM Frameworks

Testing

Python

#llm#gpt4#evaluation

Stay in the loop

Get weekly updates on trending AI coding tools and projects.