Explore Projects

Discover 14 open source projects

Active filters (1):

Search: grpo×

Clear all

Showing 1-14 of 14 projects

modelscope/ms-swift

A Python library for using and fine-tuning over 900 large language models and multimodal models for various AI tasks.

12.9K

Active

Python

LLM Frameworks

Python

#llm#multimodal#fine-tuning

OpenPipe/ART

Train multi-step agents for real-world tasks using GRPO, supporting AI tools like Qwen2.5, Qwen3, and Llama.

8.9K

Active

Python

Agents & Orchestration

Reinforcement Learning

Python

#agent#reinforcement-learning#llms

om-ai-lab/VLM-R1

A Python-based library for solving visual understanding tasks using reinforced visual-linguistic models (VLMs).

5.9K

Stable

Python

LLM Frameworks

Computer Vision

Python

#deepseek-r1#multimodal#reinforcement-learning

shibing624/MedicalGPT

This repository allows developers to train their own medical language models using the ChatGPT training pipeline.

4.8K

Active

Python

LLM Frameworks

Fine-tuning

Python

#chatgpt#gpt#llama

Orchestra-Research/AI-research-SKILLs

Comprehensive open-source library of AI research and engineering skills for any AI model.

4.4K

Active

TeX

LLM Frameworks

Agents & Orchestration

#ai#machine-learning#claude

SkyworkAI/Skywork-R1V

An advanced multimodal AI model series for vision-language reasoning, developed by Skywork AI.

3.2K

Stable

Python

LLM Frameworks

Agents & Orchestration

Python

#multimodal#vision-language#reasoning

adongwanai/AgentGuide

A comprehensive guide and resources for developing AI agents and working with large language models (LLMs).

2.1K

Active

HTML

LLM Frameworks

Agents & Orchestration

React

#ai-agent#langchain#llm

yifan123/flow_grpo

An open-source implementation of a Flow Matching Model for training AI agents via online reinforcement learning.

2.0K

Stable

Python

LLM Frameworks

Agents & Orchestration

Python

#ai-agents#reinforcement-learning#flow-matching

policy-gradient/GRPO-Zero

Implements the GRPO algorithm from scratch for developers interested in reinforcement learning and AI agents.

1.8K

Experimental

Python

Agents & Orchestration

CLI Tools

Python

#reinforcement-learning#ai-agents#grpo

langfengQ/verl-agent

verl-agent is a Python framework for training LLM/VLM agents using reinforcement learning.

1.6K

Active

Python

LLM Frameworks

Agents & Orchestration

Python

#large-language-models#llm-training#reinforcement-learning

lsdefine/simple_GRPO

A simple Python implementation of a GRPO-like LLM for reproducing r1-like thinking.

1.6K

Stable

Python

LLM Frameworks

CLI Tools

#llm#language-model#cli

XueZeyue/DanceGRPO

An official implementation of DanceGRPO, a visual generation model that leverages GRPO techniques.

1.5K

Stable

Python

Computer Vision

Inference

Python

#computer-vision#generative-ai#visual-generation

Tencent-Hunyuan/MixGRPO

MixGRPO is a Python library that unlocks flow-based GRPO efficiency with mixed ODE-SDE for diffusion and reinforcement learning.

1.1K

Stable

Python

Agents & Orchestration

Reinforcement-Learning

#diffusion#grpo#reinforcement-learning

JudgmentLabs/judgeval

An open-source post-building layer for AI agents, providing environment data and evaluations to power agent post-training and monitoring.

1.0K

Active

Python

Agents & Orchestration

LLM Frameworks

Python

#agent#agentic-ai#llm-evaluation

Stay in the loop

Get weekly updates on trending AI coding tools and projects.