Showing 1-7 of 7 projects
DouZero is a deep reinforcement learning framework for mastering the Chinese card game DouDizhu.
KataGo is an open-source Go engine and self-play learning platform for AI research and development.
A clean implementation of the AlphaZero algorithm for playing various games like Othello, Gomoku, and TicTacToe.
Comprehensive reinforcement learning framework for building AI agents and distributed systems
A powerful benchmark for Monte Carlo Tree Search in sequential decision-making scenarios.
An open-source AI platform for training and deploying AI agents for StarCraft II, with distributed training and grandmaster-level performance.
The official implementation of Self-Play Fine-Tuning (SPIN), a deep learning technique for fine-tuning large language models.
Get weekly updates on trending AI coding tools and projects.