Found 5,349 repositories(showing 30)
Universal MCT wrapper script for all Windows 10/11 versions from 1507 to 21H2!
hijkzzz
A collection of LLM papers, blogs, and projects, with a focus on OpenAI o1 🍓 and reasoning techniques.
suragnair
A clean implementation based on AlphaZero for any game in any framework + tutorial + Othello/Gobang/TicTacToe/Connect4 and more
junxiaosong
An implementation of the AlphaZero algorithm for Gomoku (also called Gobang or Five in a Row)
werner-duvaud
MuZero
opendilab
[NeurIPS 2023 Spotlight] LightZero: A Unified Benchmark for Monte Carlo Tree Search in General Sequential Decision Scenarios (awesome MCTS)
zzli2022
Latest Advances on System-2 Reasoning
HJYao00
[NIPS'25 Spotlight] Mulberry, an o1-like Reasoning and Reflection MLLM Implemented via Collective MCTS
yaotingwangofficial
Multimodal Chain-of-Thought Reasoning: A Comprehensive Survey
THUDM
ReST-MCTS*: LLM Self-Training via Process Reward Guided Tree Search (NeurIPS 2024)
facebookresearch
The release codes of LA-MCTS with its application to Neural Architecture Search.
Personal notes about scientific and research works on "Decision-Making for Autonomous Driving"
s-casci
Easily train AlphaZero-like agents on any environment you want!
SonySemiconductorSolutions
Model Compression Toolkit (MCT) is an open source project for neural network model optimization under efficient, constrained hardware. This project provides researchers, developers, and engineers advanced quantization and compression tools for deploying state-of-the-art neural networks.
chncwang
A Go A.I. based on MCTS WITHOUT Neural Networks
hrpan
MCTS project for Tetris
YuxiXie
This is the repository that contains the source code for the Self-Evaluation Guided MCTS for online DPO.
haroldsultan
Python Implementations of Monte Carlo Tree Search
1989Ryan
[NeurIPS 2023] We use large language models as commonsense world model and heuristic policy within Monte-Carlo Tree Search, enabling better-reasoned decision-making for daily task planning problems.
QueensGambit
A Deep Learning UCI-Chess Variant Engine written in C++ & Python :parrot:
dylandjian
A student implementation of Alpha Go Zero
pasky
Minimalistic Go MCTS Engine
vgarciasc
Visualization of MCTS algorithm applied to Tic-tac-toe.
DataCanvasIO
A General Automated Machine Learning framework to simplify the development of End-to-end AutoML toolkits in specific domains.
JARVIS-Xs
SE-Agent is a self-evolution framework for LLM Code agents. It enables trajectory-level evolution to exchange information across reasoning paths via Revision, Recombination, and Refinement, expanding the search space and escaping local optima. On SWE-bench Verified, it achieves SOTA performance
pbsinclair42
A simple package to allow users to run Monte Carlo Tree Search on any perfect information domain
nyoki-mtl
No description available
sungyubkim
A pytorch tutorial for DRL(Deep Reinforcement Learning)
initial-h
An asynchronous/parallel method of AlphaGo Zero algorithm with Gomoku
Coff0xc
Enterprise AI Red Team Platform | 企业级AI红队平台 | 132 MCP Tools | Pure Python Engines | SDK+CLI+MCP | Auto-Download sqlmap/nuclei/ffuf | Production C2 | LLM Enhanced | Docker Sandbox | SARIF CI/CD | 1980 Tests