Showing 1-20 of 104 projects
Fine-tuning & RL for LLMs with optimized performance and memory use
AI-powered quantitative investment platform for finance and trading
A Chinese community for learning and building with the Llama large language model, with open-source and commercial-ready resources.
A comprehensive Chinese tutorial on reinforcement learning, with implementations of popular RL algorithms.
A research framework for fast prototyping of reinforcement learning algorithms.
This repository contains tutorials, assignments, and competitions for MIT's deep learning courses, covering a wide range of AI and machine learning topics.
An elegant PyTorch deep reinforcement learning library for vibe coders building AI-powered applications.
A curated list of reinforcement learning resources for developers interested in this field of AI.
An open-source, scalable, and high-performance RL framework for building AI-powered applications and tools.
Train multi-step agents for real-world tasks using GRPO, supporting AI tools like Qwen2.5, Qwen3, and Llama.
A course in reinforcement learning for developers interested in building AI-powered applications.
General-purpose sandbox platform providing multi-language SDKs and Docker/K8s runtimes for AI agents.
A deep reinforcement learning library for the Keras deep learning framework, enabling AI-powered applications.
This is a repository for the Udacity Deep Reinforcement Learning Nanodegree program, focusing on deep reinforcement learning algorithms and techniques.
This repo contains a course on Deep Reinforcement Learning from Hugging Face.
EasyR1 is an efficient and scalable multi-modality reinforcement learning training framework based on veRL.
An LLM post-training framework for scaling Reinforcement Learning.
Hands-on guide for reinforcement learning, a key technique in AI and ML-powered coding tools.
Massively parallel deep reinforcement learning library for efficient, stable, and lightweight RL models.
An efficient, scalable RL training framework for reasoning and search engine calling interleaved with large language models.
Get weekly updates on trending AI coding tools and projects.