Back to search
This is the official repo for the paper "Zeroth-Order Optimization Meets Human Feedback: Provable Learning via Ranking Oracles", Tang et al. https://arxiv.org/abs/2303.03751
Stars
195
Forks
21
Watchers
195
Open Issues
1
Overall repository health assessment
No package.json found
This might not be a Node.js project