Back to search
Core Idea Train a reinforcement learning agent that improves reasoning prompts for an LLM. Instead of fine-tuning the LLM directly, agent learns to optimize the reasoning process.
Stars
0
Forks
0
Watchers
0
Open Issues
0
Overall repository health assessment
No package.json found
This might not be a Node.js project
6
commits