Explore Projects

Discover 321 open source projects

Active filters (1):

Search: inference×

Clear all

Showing 21-40 of 321 projects

sgl-project/sglang

High-performance serving framework for large language and multimodal models

24.1K

Active

Python

Inference

LLM Frameworks

Python

#llm#inference#serving

karpathy/minGPT

Minimal PyTorch GPT re-implementation for training and inference

23.8K

Archived

Python

LLM Frameworks

Example Projects

PyTorch

#gpt#pytorch#llm

liguodongiot/llm-action

Comprehensive LLM engineering and application resources with training, inference, compression, and deployment guides

23.4K

Stable

HTML

Fine-tuning

Inference

#llm-training#llm-inference#llm-ops

Tencent/ncnn

High-performance mobile-optimized neural network inference framework for deploying AI models on mobile devices

22.9K

Active

C++

Inference

Cross-Platform

#deep-learning#mobile-ai#ios

SYSTRAN/faster-whisper

Faster Whisper transcription with CTranslate2 for efficient speech-to-text

21.3K

Stable

Python

Inference

Local Inference Engines

CTranslate2

#speech-to-text#inference#quantization

FunAudioLLM/CosyVoice

Multilingual voice generation model with full-stack capabilities for TTS, training, and deployment

19.8K

Active

Python

AI Voice & Speech

Fine-tuning

PyTorch

#tts#voice-generation#multilingual

karpathy/llama2.c

A single-file C implementation of Llama 2 for efficient large language model inference

19.2K

Archived

LLM Frameworks

#large-language-model#llama#inference

facebookresearch/sam2

A repository for running inference with the Meta Segment Anything Model 2 (SAM 2) and example notebooks.

18.6K

Archived

Jupyter Notebook

Computer Vision

Jupyter Notebook

#computer-vision#image-segmentation#machine-learning

meta-llama/llama-cookbook

This is a comprehensive guide for building with the LLaMA language model, covering inference, fine-tuning, and end-to-end solutions.

18.2K

Stable

Jupyter Notebook

LLM Frameworks

PyTorch

#llama#language-models#fine-tuning

mlc-ai/web-llm

High-performance in-browser LLM inference engine for building AI-powered web applications and tools.

17.5K

Active

TypeScript

LLM Frameworks

React

#chatgpt#language-model#llm

stas00/ml-engineering

An open-source machine learning engineering reference with resources for training, deploying, and scaling AI models.

17.3K

Active

Python

LLM Frameworks

PyTorch

#machine-learning#deployment#scalability

kvcache-ai/ktransformers

A flexible framework for optimizing heterogeneous LLM inference and fine-tuning workflows.

16.7K

Active

Python

LLM Frameworks

React

#llm#inference#fine-tuning

meta-llama/codellama

Inference code for CodeLlama models, a developer platform focused on AI-powered coding tools and workflows.

16.3K

Archived

Python

AI Code Generation

Python

#llm#code-generation#ai-coding

ddbourgin/numpy-ml

A comprehensive machine learning library in Python with implementations of various algorithms and models.

16.3K

Archived

Python

ML Ops

#machine-learning#numpy#deep-learning

facebook/infer

A static analyzer for Java, C, C++, and Objective-C written in OCaml.

15.5K

Active

OCaml

#static-analysis#code-quality#compiler

gvergnaud/ts-pattern

A powerful pattern matching library for TypeScript with smart type inference to simplify control flow.

14.8K

Active

TypeScript

CLI Tools

TypeScript

#branching#conditions#exhaustive

cheahjs/free-llm-api-resources

A comprehensive collection of free LLM inference resources accessible via API for AI developers.

14.1K

Active

Python

LLM Frameworks

API Clients & Testing

#llm#ai#api

lyogavin/airllm

AirLLM 70B inference with single 4GB GPU

13.5K

Stable

Jupyter Notebook

React

#open-source-models#generative-ai#inference

Lightning-AI/litgpt

A collection of high-performance large language models (LLMs) with recipes to pretrain, finetune, and deploy at scale.

13.2K

Active

Python

LLM Frameworks

Python

#ai#artificial-intelligence#large-language-models

NVIDIA/TensorRT-LLM

TensorRT LLM provides a Python API and optimizations to efficiently run large language models on NVIDIA GPUs.

13.0K

Active

Python

LLM Frameworks

PyTorch

#cuda#llm-serving#moe

13...17

Stay in the loop

Get weekly updates on trending AI coding tools and projects.