Search Results

Found 5,349 repositories(showing 30)

MediaCreationTool.bat

AveYo

💚95

Universal MCT wrapper script for all Windows 10/11 versions from 1507 to 21H2!

10.2k

3.1k

MIT

Batchfile

Updated 14 hours ago

isotpm-bypasswindows+2

Awesome-LLM-Strawberry

hijkzzz

💛81

A collection of LLM papers, blogs, and projects, with a focus on OpenAI o1 🍓 and reasoning techniques.

6.9k

368

Apache-2.0

Updated 10 hours ago

chain-of-thoughtcodingllm+5

alpha-zero-general

suragnair

💛86

A clean implementation based on AlphaZero for any game in any framework + tutorial + Othello/Gobang/TicTacToe/Connect4 and more

4.4k

1.2k

MIT

Jupyter Notebook

Updated 1 day ago

alpha-zeroalphagoalphago-zero+14

AlphaZero_Gomoku

junxiaosong

💛84

An implementation of the AlphaZero algorithm for Gomoku (also called Gobang or Five in a Row)

3.6k

1.0k

MIT

Python

Updated 1 day ago

alphagoalphago-zeroalphazero+10

muzero-general

werner-duvaud

💛80

MuZero

2.8k

673

MIT

Python

Updated 2 days ago

alphagoalphazerodeep-learning+16

LightZero

opendilab

💛73

[NeurIPS 2023 Spotlight] LightZero: A Unified Benchmark for Monte Carlo Tree Search in General Sequential Decision Scenarios (awesome MCTS)

1.6k

188

Apache-2.0

Python

Updated 4 hours ago

alpha-beta-pruningalphazeroatari+17

Awesome-System2-Reasoning-LLM

zzli2022

🧡67

Latest Advances on System-2 Reasoning

1.3k

Python

Updated 7 hours ago

benchmarkmacro-actionmcts+9

Mulberry

HJYao00

🧡67

[NIPS'25 Spotlight] Mulberry, an o1-like Reasoning and Reflection MLLM Implemented via Collective MCTS

1.2k

113

Python

Updated 1 day ago

Awesome-MCoT

yaotingwangofficial

🧡61

Multimodal Chain-of-Thought Reasoning: A Comprehensive Survey

977

Updated 18 hours ago

chain-of-thoughtcotdeepseek-r1+12

ReST-MCTS

THUDM

🧡66

ReST-MCTS*: LLM Self-Training via Process Reward Guided Tree Search (NeurIPS 2024)

701

Python

Updated 1 day ago

LaMCTS

facebookresearch

🧡56

The release codes of LA-MCTS with its application to Neural Architecture Search.

481

NOASSERTION

Python

Updated 3 weeks ago

My_Bibliography_for_Research_on_Autonomous_Driving

chauvinSimon

🧡66

Personal notes about scientific and research works on "Decision-Making for Autonomous Driving"

467

101

Updated 3 days ago

behavioral-cloningbelief-planningbibliography+17

tinyzero

s-casci

🧡61

Easily train AlphaZero-like agents on any environment you want!

435

MIT

Python

Updated 2 weeks ago

alphazeromctsreinforcement-learning

Model Compression Toolkit (MCT) is an open source project for neural network model optimization under efficient, constrained hardware. This project provides researchers, developers, and engineers advanced quantization and compression tools for deploying state-of-the-art neural networks.

434

Apache-2.0

Python

Updated 2 days ago

deep-learningdeep-neural-networksedge-ai+10

FoolGo

chncwang

🧡52

A Go A.I. based on MCTS WITHOUT Neural Networks

379

195

MIT

C++

Updated 2 months ago

the-game-of-go

tetris_mcts

hrpan

❤️36

MCTS project for Tetris

348

Python

Updated 3 months ago

deep-learninggamemcts+3

MCTS-DPO

YuxiXie

❤️46

This is the repository that contains the source code for the Self-Evaluation Guided MCTS for online DPO.

329

Apache-2.0

Jupyter Notebook

Updated 2 months ago

MCTS

haroldsultan

🧡66

Python Implementations of Monte Carlo Tree Search

324

Python

Updated 21 hours ago

llm-mcts

1989Ryan

🧡56

[NeurIPS 2023] We use large language models as commonsense world model and heuristic policy within Monte-Carlo Tree Search, enabling better-reasoned decision-making for daily task planning problems.

300

Apache-2.0

Python

Updated 2 weeks ago

large-language-modelsneurips-2023task-planning

CrazyAra

QueensGambit

🧡56

A Deep Learning UCI-Chess Variant Engine written in C++ & Python :parrot:

285

GPL-3.0

Jupyter Notebook

Updated 2 weeks ago

alphagoalphazeroartificial-intelligence+13

SuperGo

dylandjian

❤️36

A student implementation of Alpha Go Zero

284

Python

Updated 3 months ago

alphagoalphago-zeromachine-learning+4

michi

pasky

❤️46

Minimalistic Go MCTS Engine

276

Python

Updated 1 month ago

mcts-viz

vgarciasc

🧡55

Visualization of MCTS algorithm applied to Tic-tac-toe.

271

JavaScript

Updated 1 week ago

mctsp5jstictactoe+1

Hypernets

DataCanvasIO

💛71

A General Automated Machine Learning framework to simplify the development of End-to-end AutoML toolkits in specific domains.

264

Apache-2.0

Python

Updated 3 days ago

autodlautomlenas+10

SE-Agent

JARVIS-Xs

💛71

SE-Agent is a self-evolution framework for LLM Code agents. It enables trajectory-level evolution to exchange information across reasoning paths via Revision, Recombination, and Refinement, expanding the search space and escaping local optima. On SWE-bench Verified, it achieves SOTA performance

246

MIT

Python

Updated 2 days ago

claude-codecode-agentcode-fix+5

MCTS

pbsinclair42

🧡61

A simple package to allow users to run Monte Carlo Tree Search on any perfect information domain

238

MIT

Python

Updated 1 week ago

bert-mcts-youtube

nyoki-mtl

❤️25

No description available

227

Python

Updated 6 months ago

Deep_RL_with_pytorch

sungyubkim

❤️46

A pytorch tutorial for DRL(Deep Reinforcement Learning)

225

Jupyter Notebook

Updated 1 month ago

a2cc51counterfactual-regret-minimization+13

AlphaZero_Gomoku_MPI

initial-h

❤️46

An asynchronous/parallel method of AlphaGo Zero algorithm with Gomoku

219

Python

Updated 1 month ago

algorithmalphagoalphazero+10

AutoRedTeam-Orchestrator

Coff0xc

🧡66

206

NOASSERTION

Python

Updated 1 hour ago

active-directoryai-poweredautomation+17

GitHub Explorer

Search Results

MediaCreationTool.bat

Awesome-LLM-Strawberry

alpha-zero-general

AlphaZero_Gomoku

muzero-general

LightZero

Awesome-System2-Reasoning-LLM

Mulberry

Awesome-MCoT

ReST-MCTS

LaMCTS

My_Bibliography_for_Research_on_Autonomous_Driving

tinyzero

mct-model-optimization

FoolGo

tetris_mcts

MCTS-DPO

MCTS

llm-mcts

CrazyAra

SuperGo

michi

mcts-viz

Hypernets

SE-Agent

MCTS

bert-mcts-youtube

Deep_RL_with_pytorch

AlphaZero_Gomoku_MPI

AutoRedTeam-Orchestrator

MediaCreationTool.bat

Awesome-LLM-Strawberry

alpha-zero-general

AlphaZero_Gomoku

muzero-general

LightZero

Awesome-System2-Reasoning-LLM

Mulberry

Awesome-MCoT

ReST-MCTS

LaMCTS

My_Bibliography_for_Research_on_Autonomous_Driving

tinyzero

mct-model-optimization

FoolGo

tetris_mcts

MCTS-DPO

MCTS

llm-mcts

CrazyAra

SuperGo

michi

mcts-viz

Hypernets

SE-Agent

MCTS

bert-mcts-youtube

Deep_RL_with_pytorch

AlphaZero_Gomoku_MPI

AutoRedTeam-Orchestrator