mostlygeek/llama-swap

Reliable model swapping for local LLM servers - seamlessly switch between llama.cpp, vLLM, and compatible backends

View on GitHub

Local AI & Model Runners

Local Inference Engines

MIT

2.6K

Stars

191

Forks

Oct 4, 2024

Created

Mar 2, 2026

Last Updated

Project Analytics

Stars Growth (1 Month)

+155

+6.4% change

Avg Daily Growth (1 Month)

+8.6

stars per day

Fork/Star Ratio (All Time)

7.4%

Normal engagement

Lifetime Growth

4.9

stars/day over 519 days

Stars Over Time

Forks Over Time

Open Issues Over Time

Topics

AI-Generated Tags

local-llm

model-swapping

llama-cpp

vllm

openai-compatible

inference

Similar Projects

ggml-org/llama.cpp

Run LLMs locally in C/C++ with high performance

96.8K

C++

nomic-ai/gpt4all

Run local LLMs on any device with GPT4All

77.2K

C++

oobabooga/text-generation-webui

Web UI for local AI with multiple backends and offline capabilities

46.1K

Python

exo-explore/exo

Run frontier AI models locally across devices using RDMA and tensor parallelism

42.1K

Python

Comments (0)

No comments yet. Be the first to comment!

Stay in the loop

Get weekly updates on trending AI coding tools and projects.