Explore Projects

Discover 10 open source projects

Active filters (1):

Search: post-training×

Clear all

Showing 1-10 of 10 projects

meta-pytorch/torchtune

torchtune is a PyTorch-native post-training library for fine-tuning and tuning machine learning models.

5.7K

Active

Python

Fine-tuning

Inference

PyTorch

#machine-learning#fine-tuning#post-training

THUDM/slime

An LLM post-training framework for scaling Reinforcement Learning.

4.6K

Active

Python

LLM Frameworks

API Frameworks

Python

#reinforcement-learning#llm#post-training

allenai/open-instruct

AllenAI's open-source post-training codebase for building AI models and agents.

3.6K

Active

Python

LLM Frameworks

Agents & Orchestration

Python

#llm#language-model#ai-agent

hao-ai-lab/FastVideo

A unified inference and post-training framework for accelerated video generation powered by AI.

3.1K

Active

Python

Computer Vision

Inference

PyTorch

#video-generation#diffusion-models#distillation

thinking-machines-lab/tinker-cookbook

A Python library for post-training with the Tinker AI coding tool, focused on vibe coders.

2.9K

Active

Python

AI Code Generation

LLM Wrappers & SDKs

Python

#ai-coding#code-generation#llm-sdk

SmartFlowAI/EmoLLM

This is a large language model (LLM) focused on mental health, with pre/post-training, datasets, evaluation, and deployment tools.

1.7K

Stable

Python

LLM Frameworks

Fine-tuning

#llm#mental-health#dataset

bespokelabsai/curator

A Python library for synthetic data curation and structured data extraction for machine learning models.

1.6K

Active

Python

Synthetic Data

LLM Frameworks

Python

#machine-learning#data-generation#data-curation

mit-han-lab/smoothquant

SmoothQuant is an efficient post-training quantization tool for large language models, enabling accurate and fast inference.

1.6K

Archived

Python

LLM Frameworks

Inference

Python

#quantization#large-language-models#performance-optimization

meta-pytorch/OpenEnv

An interface library for reinforcement learning (RL) post-training with environments.

1.2K

Active

Python

Agents & Orchestration

CLI Tools

Python

#reinforcement-learning#rl#environments

JudgmentLabs/judgeval

An open-source post-building layer for AI agents, providing environment data and evaluations to power agent post-training and monitoring.

1.0K

Active

Python

Agents & Orchestration

LLM Frameworks

Python

#agent#agentic-ai#llm-evaluation

Stay in the loop

Get weekly updates on trending AI coding tools and projects.