Found 1 repositories(showing 1)
TZW1998
This is the official repo for the paper "Zeroth-Order Optimization Meets Human Feedback: Provable Learning via Ranking Oracles", Tang et al. https://arxiv.org/abs/2303.03751
All 1 repositories loaded