Explore Projects

Discover 14 open source projects

Active filters (1):
Search: grpoร—
Clear all

Showing 1-14 of 14 projects

modelscope/ms-swift

A Python library for using and fine-tuning over 900 large language models and multimodal models for various AI tasks.

12.9K
Active
Python
LLM Frameworks
Python
#llm#multimodal#fine-tuning

OpenPipe/ART

Train multi-step agents for real-world tasks using GRPO, supporting AI tools like Qwen2.5, Qwen3, and Llama.

8.9K
Active
Python
Agents & Orchestration
Reinforcement Learning
Python
#agent#reinforcement-learning#llms

om-ai-lab/VLM-R1

A Python-based library for solving visual understanding tasks using reinforced visual-linguistic models (VLMs).

5.9K
Stable
Python
LLM Frameworks
Computer Vision
Python
#deepseek-r1#multimodal#reinforcement-learning

shibing624/MedicalGPT

This repository allows developers to train their own medical language models using the ChatGPT training pipeline.

4.8K
Active
Python
LLM Frameworks
Fine-tuning
Python
#chatgpt#gpt#llama

Orchestra-Research/AI-research-SKILLs

Comprehensive open-source library of AI research and engineering skills for any AI model.

4.4K
Active
TeX
LLM Frameworks
Agents & Orchestration
#ai#machine-learning#claude

SkyworkAI/Skywork-R1V

An advanced multimodal AI model series for vision-language reasoning, developed by Skywork AI.

3.2K
Stable
Python
LLM Frameworks
Agents & Orchestration
Python
#multimodal#vision-language#reasoning

adongwanai/AgentGuide

A comprehensive guide and resources for developing AI agents and working with large language models (LLMs).

2.1K
Active
HTML
LLM Frameworks
Agents & Orchestration
React
#ai-agent#langchain#llm

yifan123/flow_grpo

An open-source implementation of a Flow Matching Model for training AI agents via online reinforcement learning.

2.0K
Stable
Python
LLM Frameworks
Agents & Orchestration
Python
#ai-agents#reinforcement-learning#flow-matching

policy-gradient/GRPO-Zero

Implements the GRPO algorithm from scratch for developers interested in reinforcement learning and AI agents.

1.8K
Experimental
Python
Agents & Orchestration
CLI Tools
Python
#reinforcement-learning#ai-agents#grpo

langfengQ/verl-agent

verl-agent is a Python framework for training LLM/VLM agents using reinforcement learning.

1.6K
Active
Python
LLM Frameworks
Agents & Orchestration
Python
#large-language-models#llm-training#reinforcement-learning

lsdefine/simple_GRPO

A simple Python implementation of a GRPO-like LLM for reproducing r1-like thinking.

1.6K
Stable
Python
LLM Frameworks
CLI Tools
#llm#language-model#cli

XueZeyue/DanceGRPO

An official implementation of DanceGRPO, a visual generation model that leverages GRPO techniques.

1.5K
Stable
Python
Computer Vision
Inference
Python
#computer-vision#generative-ai#visual-generation

Tencent-Hunyuan/MixGRPO

MixGRPO is a Python library that unlocks flow-based GRPO efficiency with mixed ODE-SDE for diffusion and reinforcement learning.

1.1K
Stable
Python
Agents & Orchestration
Reinforcement-Learning
#diffusion#grpo#reinforcement-learning

JudgmentLabs/judgeval

An open-source post-building layer for AI agents, providing environment data and evaluations to power agent post-training and monitoring.

1.0K
Active
Python
Agents & Orchestration
LLM Frameworks
Python
#agent#agentic-ai#llm-evaluation

Stay in the loop

Get weekly updates on trending AI coding tools and projects.