Showing 1-7 of 7 projects
A curated collection of LLM papers, blogs, and projects, focused on OpenAI o1 and reasoning techniques.
A clean implementation of the AlphaZero algorithm for playing various games like Othello, Gomoku, and TicTacToe.
An implementation of the AlphaZero algorithm for the classic board game Gomoku (Five in a Row)
An implementation of the MuZero reinforcement learning algorithm for general-purpose use cases.
A powerful benchmark for Monte Carlo Tree Search in sequential decision-making scenarios.
Comprehensive repository showcasing the latest advances in System-2 reasoning and LLM-based AI models.
Mulberry is an o1-like Reasoning and Reflection MLLM implemented via Collective MCTS for AI-powered coding tools.
Get weekly updates on trending AI coding tools and projects.