Back to search
[TMLR 25] SFT or RL? An Early Investigation into Training R1-Like Reasoning Large Vision-Language Models
Stars
148
Forks
1
Watchers
148
Open Issues
3
Overall repository health assessment
No package.json found
This might not be a Node.js project