Explore Projects

Discover 321 open source projects

Active filters (1):
Search: inferenceร—
Clear all

Showing 21-40 of 321 projects

sgl-project/sglang

High-performance serving framework for large language and multimodal models

24.1K
Active
Python
Inference
LLM Frameworks
Python
#llm#inference#serving

karpathy/minGPT

Minimal PyTorch GPT re-implementation for training and inference

23.8K
Archived
Python
LLM Frameworks
Example Projects
PyTorch
#gpt#pytorch#llm

liguodongiot/llm-action

Comprehensive LLM engineering and application resources with training, inference, compression, and deployment guides

23.4K
Stable
HTML
Fine-tuning
Inference
#llm-training#llm-inference#llm-ops

Tencent/ncnn

High-performance mobile-optimized neural network inference framework for deploying AI models on mobile devices

22.9K
Active
C++
Inference
Cross-Platform
#deep-learning#mobile-ai#ios

SYSTRAN/faster-whisper

Faster Whisper transcription with CTranslate2 for efficient speech-to-text

21.3K
Stable
Python
Inference
Local Inference Engines
CTranslate2
#speech-to-text#inference#quantization

FunAudioLLM/CosyVoice

Multilingual voice generation model with full-stack capabilities for TTS, training, and deployment

19.8K
Active
Python
AI Voice & Speech
Fine-tuning
PyTorch
#tts#voice-generation#multilingual

karpathy/llama2.c

A single-file C implementation of Llama 2 for efficient large language model inference

19.2K
Archived
C
LLM Frameworks
#large-language-model#llama#inference

facebookresearch/sam2

A repository for running inference with the Meta Segment Anything Model 2 (SAM 2) and example notebooks.

18.6K
Archived
Jupyter Notebook
Computer Vision
Jupyter Notebook
#computer-vision#image-segmentation#machine-learning

meta-llama/llama-cookbook

This is a comprehensive guide for building with the LLaMA language model, covering inference, fine-tuning, and end-to-end solutions.

18.2K
Stable
Jupyter Notebook
LLM Frameworks
PyTorch
#llama#language-models#fine-tuning

mlc-ai/web-llm

High-performance in-browser LLM inference engine for building AI-powered web applications and tools.

17.5K
Active
TypeScript
LLM Frameworks
React
#chatgpt#language-model#llm

stas00/ml-engineering

An open-source machine learning engineering reference with resources for training, deploying, and scaling AI models.

17.3K
Active
Python
LLM Frameworks
PyTorch
#machine-learning#deployment#scalability

kvcache-ai/ktransformers

A flexible framework for optimizing heterogeneous LLM inference and fine-tuning workflows.

16.7K
Active
Python
LLM Frameworks
React
#llm#inference#fine-tuning

meta-llama/codellama

Inference code for CodeLlama models, a developer platform focused on AI-powered coding tools and workflows.

16.3K
Archived
Python
AI Code Generation
Python
#llm#code-generation#ai-coding

ddbourgin/numpy-ml

A comprehensive machine learning library in Python with implementations of various algorithms and models.

16.3K
Archived
Python
ML Ops
#machine-learning#numpy#deep-learning

facebook/infer

A static analyzer for Java, C, C++, and Objective-C written in OCaml.

15.5K
Active
OCaml
OCaml
#static-analysis#code-quality#compiler

gvergnaud/ts-pattern

A powerful pattern matching library for TypeScript with smart type inference to simplify control flow.

14.8K
Active
TypeScript
CLI Tools
TypeScript
#branching#conditions#exhaustive

cheahjs/free-llm-api-resources

A comprehensive collection of free LLM inference resources accessible via API for AI developers.

14.1K
Active
Python
LLM Frameworks
API Clients & Testing
#llm#ai#api

lyogavin/airllm

AirLLM 70B inference with single 4GB GPU

13.5K
Stable
Jupyter Notebook
React
#open-source-models#generative-ai#inference

Lightning-AI/litgpt

A collection of high-performance large language models (LLMs) with recipes to pretrain, finetune, and deploy at scale.

13.2K
Active
Python
LLM Frameworks
Python
#ai#artificial-intelligence#large-language-models

NVIDIA/TensorRT-LLM

TensorRT LLM provides a Python API and optimizations to efficiently run large language models on NVIDIA GPUs.

13.0K
Active
Python
LLM Frameworks
PyTorch
#cuda#llm-serving#moe
13...17

Stay in the loop

Get weekly updates on trending AI coding tools and projects.