Showing 41-60 of 199 projects
Open-source stack for industrial-grade LLM applications, including LLM gateway, observability, optimization, evaluation, and experimentation.
A self-hosted, offline, ChatGPT-like chatbot powered by Llama 2 with no data leaving your device.
An open-source Java library that simplifies integrating LLMs into Java apps through a unified API.
A framework for testing and evaluating large language models, prompts, and AI agents for security and performance.
A Python library for efficient RAG (Retrieval-Augmented Generation) applications with AI-powered vector database and private storage.
Python bindings for the llama.cpp library, enabling developers to use LLMs in their Python projects.
A distributed system for running large language models (LLMs) on personal devices, enabling faster fine-tuning and inference.
Unified, production-ready inference API to run open-source, speech, and multimodal models on cloud, on-prem, or your laptop.
Easily fine-tune, evaluate and deploy open-source large language models like GPT-OSS and Llama.
An open-source project to pre-train a 1.1B Llama model on 3 trillion tokens for AI coding tools and agents.
Train multi-step agents for real-world tasks using GRPO, supporting AI tools like Qwen2.5, Qwen3, and Llama.
High-performance C++ library for fast local deployment of large language models (LLMs) like LLaMA.
An accelerator for local LLM inference and fine-tuning on Intel XPUs, with seamless integration into popular LLM frameworks.
A private & local AI personal knowledge management app for high entropy people with a focus on vibe coders.
Open-source Chinese language model BELLE for building AI-powered chatbots and conversational applications.
Composable building blocks for LLM apps using Python
Semantic cache for large language models, fully integrated with LangChain and llama_index.
A comprehensive SDK for running frontier LLMs and VLMs on multiple hardware platforms with day-0 model support.
LMDeploy is a toolkit for compressing, deploying, and serving large language models (LLMs).
A comprehensive collection of resources for large language models, including AI coding tools, MCP frameworks, and more.
Get weekly updates on trending AI coding tools and projects.