GitHub Explorer

by Alexey Ratnikov

GitHub Explorer

GitHub Explorer|TRENDING COMPARE|FEEDBACK

Back to search

tokenbender/avataRL - GitHub Explorer | GitHub Explorer | Trending | Compare

Back to search

avataRL

tokenbender•PUBLIC

View on GitHub

rl from zero pretrain, can it be done? yes.

Created on May 30, 2025

Updated on Mar 27, 2026

Stars

291

Forks

Watchers

291

Open Issues

Repository Health Score

🧡

56/100

Fair

Overall repository health assessment

Score Breakdown

Activity

Regular updates - updated this month

20/30

67%

Issues Analytics

Total Issues

All time

Open

4% of total

Closed

Recent Commits

Clean up docs/ and issues/ directories from main branch

tokenbender•7 months ago

d291d9bView on GitHub

Merge pull request #43 from tokenbender/full-vocab-softmax-gradient-flow

tokenbender•7 months ago

2ce1595View on GitHub

config: adjust training parameters for full vocab experiments

tokenbender•7 months ago

042c667View on GitHub

feat: compute softmax over full vocabulary for complete gradient flow

tokenbender•7 months ago

fb40379View on GitHub

refactor: move hardcoded values to config for better flexibility

tokenbender•7 months ago

8cf8549View on GitHub

add optimizations with vectorized operations, action space based probs calculation, removal of entropy calculation over entite vocab

User•7 months ago

9d9df27View on GitHub

Merge branch 'optimize-avatarl-vectorization'

User•7 months ago

b6f89eeView on GitHub

add basic timing bash file and corresponding small iteration config

User•7 months ago

c46c34eView on GitHub

Update training configuration and fix evaluation timing

User•7 months ago

2f1238eView on GitHub

fix critic download path reading from config file in start.sh

tokenbender•7 months ago

227791fView on GitHub

Merge pull request #41 from tokenbender/feat/4bit-distributed-critic

tokenbender•7 months ago

f894a60View on GitHub

feat: add 4-bit quantization support for critic model in AvataRL

tokenbender•7 months ago

0f92f2eView on GitHub

feat: add 4-bit quantization support to sample.py

tokenbender•7 months ago

c9571e6View on GitHub

perf: optimize checkpoint loading to reduce memory usage

tokenbender•7 months ago

9d01979View on GitHub

feat: add 4-bit quantized critic loading for memory optimization

tokenbender•7 months ago

602455aView on GitHub

View all commits

GitHub Explorer

avataRL

Score Breakdown

Issues Activity: Last 6 months

Top Labels

Hottest Issues