🌾 OAT: A research-friendly framework for LLM online alignment, including reinforcement learning, preference learning, etc.
Stars
646
Forks
63
Watchers
646
Open Issues
7
Overall repository health assessment
No package.json found
This might not be a Node.js project
53
commits
2
commits
2
commits
2
commits
2
commits
1
commits
1
commits
1
commits
1
commits
1
commits
feat: migrate to vllm AsyncLLMEngine with async generation support (#77)
8697066View on GitHubchore: minor updates on logging and resource allocation (#73)
1b52eedView on GitHubfix: incorrect state indexing in PPOMultiTurnLearner critic training (#67)
bc30eafView on GitHubfix: truncated importance sampling to handle precision mismatch (#62)
f9adda7View on GitHubfix: use semantic version comparison for vLLM version check (#60)
7174a37View on GitHub