GitHub Explorer

by Alexey Ratnikov

GitHub Explorer

GitHub Explorer|TRENDING COMPARE|FEEDBACK

Back to search

Copyright (c) 2026 Alexey Ratnikov

ellanorai/ThinkRL - GitHub Explorer | GitHub Explorer | Trending | Compare

ThinkRL

ellanorai•PUBLIC

ThinkRL is a comprehensive, state-of-the-art reinforcement learning from human feedback (RLHF) library designed to democratize advanced AI training. Built with a zero-dependency core philosophy, ThinkRL provides researchers and developers with cutting-edge algorithms, reasoning capabilities, and multimodal support in a single, unified platform.

deeplearningpytorchreinforcement-learningtransformers-models

Other

Created on Jun 15, 2025

Updated on Mar 21, 2026

Stars

7

Forks

1

Watchers

7

Open Issues

0

Repository Health Score

🧡

60/100

Fair

Overall repository health assessment

Score Breakdown

Activity

Regular updates - updated this month

20/30

67%

Community

7 stars, 1 forks

0/30

0%

Documentation

Has description, wiki, license

20/20

100%

Maintenance

0.0% issue ratio

20/20

100%

Health score is calculated based on activity, community engagement, documentation quality, and maintenance practices

Languages

Python

99.7%

Makefile

0.3%

Dependencies

No package.json found

This might not be a Node.js project

Top Contributors

1

Archit03

User

212

commits

2

claude

User

25

commits

3

ellanorai

User

14

commits

4

godspeed-003

User

9

commits

5

shashwat051102

User

3

commits

Recent Commits

Merge pull request #62 from ellanorai/star

ellanorai•2 weeks ago

3121e04View on GitHub

test: Add CLI tests for the `star` command to verify argument parsing and parameter passing.

archit03•2 weeks ago

7068cdeView on GitHub

feat: Introduce STaR, GRPO, and PRIME algorithms with associated training infrastructure, datasets, and CLIs, replacing `run_eval.py`.

archit03•2 weeks ago

7b01d77View on GitHub

Potential fix for code scanning alert no. 178: Unused import

godspeed•3 weeks ago

7c1eae7View on GitHub

Potential fix for code scanning alert no. 177: Unused import

godspeed•3 weeks ago

4fb11edView on GitHub

Potential fix for code scanning alert no. 181: Unused local variable

godspeed•3 weeks ago

d91061aView on GitHub

Potential fix for code scanning alert no. 180: Unused local variable

godspeed•3 weeks ago

47cb108View on GitHub

Potential fix for code scanning alert no. 176: Unused import

godspeed•3 weeks ago

191baf6View on GitHub

Potential fix for code scanning alert no. 175: Unused import

godspeed•3 weeks ago

df0f00dView on GitHub

Potential fix for code scanning alert no. 179: Unused import

godspeed•3 weeks ago

83be029View on GitHub

feat: Implement STaR (Self-Taught Reasoner) algorithm, trainer, and CLI

godspeed•3 weeks ago

aa8389aView on GitHub

Merge pull request #61 from ellanorai/dev

ellanorai•3 weeks ago

d054e5fView on GitHub

feat: Add a new CLI command for Group Relative Policy Optimization (GRPO) training.

archit03•3 weeks ago

fbe3cd9View on GitHub

Merge pull request #60 from Archit/dev

ellanorai•3 weeks ago

1b94cfeView on GitHub

feat: Add ThinkRL CLI with initial commands for training, generation, merging, info, SFT, and GRPO.

archit03•3 weeks ago

654bc88View on GitHub

View all commits