Found 16 repositories(showing 16)
10-OASIS-01
MinRL provides clean, minimal implementations of fundamental reinforcement learning algorithms in a customizable GridWorld environment. The project focuses on educational clarity and implementation simplicity while maintaining production-quality code standards.
thomfoster
A (somewhat) minimal library for finetuning language models with PPO on human feedback.
avilum
Stop forcing LLMs to answer in one pass. Give them a runtime. Recursive Language Model that improves any LLM, while reducing token usage up to 4X.
ttumiel
Minimal RLHF implementation built on top of minGPT.
anxkhn
serverless url shortener built on top of cloudflare stack
janhuenermann
Educational implementation of RL for LLMs in Pytorch
albertwujj
RL algorithms, minimal
srpsco
Automatically exported from code.google.com/p/minrlz
dev0x13
Task for a JetBrains Internship 2017 application: RLE compression library port from C to Rust
lxuechen
A minimal PyTorch re-implementation of RLHF
brianfitzgerald
No description available
rupeshp-dev
No description available
timothygao8710
No description available
NhatHoang2002
No description available
c-dric
Sensor Logger
ayushr012
No description available
All 16 repositories loaded