Search Results

Found 16 repositories(showing 16)

minrl

10-OASIS-01

🧡60

MinRL provides clean, minimal implementations of fundamental reinforcement learning algorithms in a customizable GridWorld environment. The project focuses on educational clarity and implementation simplicity while maintaining production-quality code standards.

122

MIT

Python

Updated 1 week ago

deepreinforcementlearninggridworld-environmentqlearning+1

minRLHF

thomfoster

🧡65

A (somewhat) minimal library for finetuning language models with PPO on human feedback.

Python

Updated 4 days ago

minrlm

avilum

🧡65

Stop forcing LLMs to answer in one pass. Give them a runtime. Recursive Language Model that improves any LLM, while reducing token usage up to 4X.

MIT

Python

Updated 2 days ago

agentai-agentscost-optimization+7

minRLHF

ttumiel

🧡60

Minimal RLHF implementation built on top of minGPT.

MIT

Python

Updated 2 weeks ago

minrl

anxkhn

❤️40

serverless url shortener built on top of cloudflare stack

GPL-3.0

JavaScript

Updated 3 months ago

minrl

janhuenermann

❤️35

Educational implementation of RL for LLMs in Pytorch

MIT

Python

Updated 10 months ago

Search Results

minrl

minRLHF

minrlm

minRLHF

minrl

minrl

minrl

minrlz

minrle_rust

minRLHF

minRL

minrlify

minRL

CZ4042-minRLHF

MinrLog

Landsat_8-MinrL-ExpLRN

minrl

minRLHF

minrlm

minRLHF

minrl

minrl

minrl

minrlz

minrle_rust

minRLHF

minRL

minrlify

minRL

CZ4042-minRLHF

MinrLog

Landsat_8-MinrL-ExpLRN