mcar18/rl-reasoning-optimizer - GitHub Explorer | GitHub Explorer | Trending

Stars

0

Forks

0

Watchers

0

Open Issues

0

Repository Health Score

❤️

45/100

Poor

Overall repository health assessment

Activity

Slow updates - updated within 3 months

10/30

33%

Recent Commits

push plots to github

mcar18•1 month ago

final results real ollama run

mcar18•1 month ago

training on full set for reinforced and looking at final results

mcar18•1 month ago

test real ollama with 30 episodes

mcar18•1 month ago

plots use stub backend by default, (no API key)

mcar18•1 month ago

baseline with 30 episodes to test run

mcar18•1 month ago