Showing 1-9 of 9 projects
Run open-source AI models locally with Ollama, supporting multiple frameworks and integrations.
High-throughput LLM inference engine for developers
Fine-tuning & RL for LLMs with optimized performance and memory use
High-performance serving framework for large language and multimodal models
OpenAI's open-weight language models for powerful reasoning and agentic tasks.
A Python library for efficient RAG (Retrieval-Augmented Generation) applications with AI-powered vector database and private storage.
Easily fine-tune, evaluate and deploy open-source large language models like GPT-OSS and Llama.
A comprehensive SDK for running frontier LLMs and VLMs on multiple hardware platforms with day-0 model support.
A native macOS app that allows users to chat with a local LLM without installing other software.
Get weekly updates on trending AI coding tools and projects.