Explore Projects

Discover 7 open source projects

Active filters (1):
Search: mctsร—
Clear all

Showing 1-7 of 7 projects

hijkzzz/Awesome-LLM-Strawberry

A curated collection of LLM papers, blogs, and projects, focused on OpenAI o1 and reasoning techniques.

6.9K
Stable
LLM Frameworks
Tutorials & Courses
#llm#openai-o1#reasoning

suragnair/alpha-zero-general

A clean implementation of the AlphaZero algorithm for playing various games like Othello, Gomoku, and TicTacToe.

4.4K
Archived
Jupyter Notebook
Reinforcement Learning
Tutorials & Courses
PyTorch
#alphazero#reinforcement-learning#game-ai

junxiaosong/AlphaZero_Gomoku

An implementation of the AlphaZero algorithm for the classic board game Gomoku (Five in a Row)

3.6K
Archived
Python
Reinforcement Learning
Board Games
PyTorch
#alphago#mcts#monte-carlo-tree-search

werner-duvaud/muzero-general

An implementation of the MuZero reinforcement learning algorithm for general-purpose use cases.

2.8K
Archived
Python
Agents & Orchestration
CLI Tools
PyTorch
#reinforcement-learning#monte-carlo-tree-search#deep-learning

opendilab/LightZero

A powerful benchmark for Monte Carlo Tree Search in sequential decision-making scenarios.

1.5K
Active
Python
Reinforcement Learning
Testing
PyTorch
#mcts#reinforcement-learning#monte-carlo-tree-search

zzli2022/Awesome-System2-Reasoning-LLM

Comprehensive repository showcasing the latest advances in System-2 reasoning and LLM-based AI models.

1.3K
Experimental
Python
LLM Frameworks
Agents & Orchestration
Python
#llm#reasoning#system-2

HJYao00/Mulberry

Mulberry is an o1-like Reasoning and Reflection MLLM implemented via Collective MCTS for AI-powered coding tools.

1.2K
Active
Python
LLM Frameworks
AI Code Generation
Python
#ai-coding#llm#mcts

Stay in the loop

Get weekly updates on trending AI coding tools and projects.