Inference

Explore 351 open source projects in Inference

Showing 1-20 of 351 projects

tensorflow/tensorflow

TensorFlow is an open-source machine learning framework for building and deploying ML models.

194.0K
Active
C++
ML Ops
Inference
Python
#tensorflow#machine-learning#deep-learning

huggingface/transformers

Model framework for state-of-the-art ML models in text, vision, audio, and multimodal tasks.

157.4K
Active
Python
LLM Frameworks
Agents & Orchestration
PyTorch
#transformers#huggingface#deep-learning

deepseek-ai/DeepSeek-V3

DeepSeek-V3 is a large-scale MoE language model with 671B parameters, optimized for efficiency and performance.

101.9K
Stable
Python
LLM Frameworks
Inference
DeepSeekMoE
#large-language-model#moe-architecture#deep-learning

pytorch/pytorch

PyTorch is a Python library for tensor computation and deep learning with GPU acceleration.

98.0K
Active
Python
Inference
ML Ops
Python
#pytorch#deep-learning#gpu

tensorflow/models

TensorFlow Model Garden with SOTA implementations

77.7K
Active
Python
Computer Vision
ML Ops
TensorFlow
#tensorflow#models#machine-learning

d2l-ai/d2l-zh

深度学习教学资源,包含可运行代码和讨论论坛。

76.0K
Archived
Python
Books & Guides
Inference
Jupyter Notebook
#deep-learning#machine-learning#computer-vision

vllm-project/vllm

High-throughput LLM inference engine for developers

72.1K
Active
Python
Inference
LLM Wrappers & SDKs
Hugging Face
#llm#inference#ai

keras-team/keras

Keras 3 is a multi-backend deep learning framework for building and training models with support for JAX, TensorFlow, PyTorch, and OpenVINO.

63.9K
Active
Python
Computer Vision
ML Ops
TensorFlow
#deep-learning#machine-learning#neural-networks

meta-llama/llama

Llama 2 inference code for running Llama models

59.2K
Archived
Python
Inference
Local Inference Engines
#llama2#inference#ai-models

xai-org/grok-1

Open-source Grok-1 model for local inference with JAX

51.5K
Archived
Python
Inference
Local Inference Engines
JAX
#grok-1#llm#inference

JuliaLang/julia

The Julia programming language for technical computing.

48.5K
Active
Julia
Full-Stack Frameworks
Documentation
Julia
#julia#programming-language#scientific-computing

freqtrade/freqtrade

Crypto trading bot with backtesting and machine learning

47.4K
Active
Python
Crypto Tools
Inference
Python
#crypto-trading#backtesting#machine-learning

ggml-org/whisper.cpp

High-performance C/C++ port of OpenAI's Whisper for speech recognition

47.2K
Active
C++
Inference
CLI Tools
#speech-to-text#c++#inference

karpathy/nanochat

Train LLMs on a single GPU for minimal cost

44.5K
Active
Python
Fine-tuning
Inference
PyTorch
#llm-training#gpu-optimization#cost-effective-ml

aymericdamien/TensorFlow-Examples

TensorFlow tutorial and examples for beginners

43.8K
Archived
Jupyter Notebook
Inference
Tutorials & Courses
TensorFlow
#tensorflow#machine-learning#deep-learning

exo-explore/exo

Run frontier AI models locally across devices using RDMA and tensor parallelism

42.1K
Active
Python
Desktop Model Runners
Inference
MLX
#ai-inference#rdma#tensor-parallelism

deepspeedai/DeepSpeed

DeepSpeed optimizes deep learning training and inference with distributed computing techniques.

41.7K
Active
Python
ML Ops
Inference
PyTorch
#deep-learning#distributed-training#inference-optimization

hpcaitech/ColossalAI

Colossal-AI optimizes large AI model training and inference with distributed computing and GPU acceleration.

41.4K
Active
Python
ML Ops
Inference
PyTorch
#ai-optimization#distributed-training#gpu-acceleration

lm-sys/FastChat

Open platform for training, serving, and evaluating LLM chatbots with Vicuna and Chatbot Arena

39.4K
Experimental
Python
LLM Frameworks
Inference
Python
#llm#vicuna#chatbot

microsoft/qlib

AI-powered quantitative investment platform for finance and trading

38.2K
Active
Python
Inference
SaaS Boilerplates
Python
#quantitative-investment#algorithmic-trading#machine-learning
2...18

Stay in the loop

Get weekly updates on trending AI coding tools and projects.