Showing 1-14 of 14 projects
A Python library for using and fine-tuning over 900 large language models and multimodal models for various AI tasks.
Train multi-step agents for real-world tasks using GRPO, supporting AI tools like Qwen2.5, Qwen3, and Llama.
A Python-based library for solving visual understanding tasks using reinforced visual-linguistic models (VLMs).
This repository allows developers to train their own medical language models using the ChatGPT training pipeline.
Comprehensive open-source library of AI research and engineering skills for any AI model.
An advanced multimodal AI model series for vision-language reasoning, developed by Skywork AI.
A comprehensive guide and resources for developing AI agents and working with large language models (LLMs).
An open-source implementation of a Flow Matching Model for training AI agents via online reinforcement learning.
Implements the GRPO algorithm from scratch for developers interested in reinforcement learning and AI agents.
verl-agent is a Python framework for training LLM/VLM agents using reinforcement learning.
A simple Python implementation of a GRPO-like LLM for reproducing r1-like thinking.
An official implementation of DanceGRPO, a visual generation model that leverages GRPO techniques.
MixGRPO is a Python library that unlocks flow-based GRPO efficiency with mixed ODE-SDE for diffusion and reinforcement learning.
An open-source post-building layer for AI agents, providing environment data and evaluations to power agent post-training and monitoring.
Get weekly updates on trending AI coding tools and projects.