Found 28,477 repositories(showing 30)
thedevelobear
A package containing a few micro-interactions you can use to reward your users for little things and make them smile!
zai-org
[NeurIPS 2023] ImageReward: Learning and Evaluating Human Preferences for Text-to-image Generation
RLHFlow
Recipes to train reward model for RLHF.
lucidrains
Implementation of the training framework proposed in Self-Rewarding Language Model, from MetaAI
charlesbel
A simple bot that uses selenium to farm Microsoft Rewards written in Python
CodeGoat24
Official implementation of UnifiedReward & [NeurIPS 2025] UnifiedReward-Think & UnifiedReward-Flex
TheNetsky
Automated Microsoft Rewards script, using TypeScript, Cheerio and Playwright.
allenai
RewardBench: the first evaluation tool for reward models.
spin311
Microsoft Automatic rewards google extension and phone app
fluencelabs
No description available
zai-org
[AAAI 2026] VisionReward: Fine-Grained Multi-Dimensional Human Preference Learning for Image and Video Generation
avisingh599
[RSS 2019] End-to-End Robotic Reinforcement Learning without Reward Engineering
Yuphee
仿斗鱼直播送礼物和连击效果动画
ruixin31
No description available
bytedance
No description available
JaydenLyh
[CVPR 2026 Highlight] Reward Forcing: Efficient Streaming Video Generation with Rewarded Distribution Matching Distillation
AnthonyZJiang
A Chrome extension for Microsoft Rewards search, for accounts with two-factor authentication.
jack-mil
A script to automate daily Bing rewards points
yfzhang114
✨✨ [ICLR 2026] R1-Reward: Training Multimodal Reward Model Through Stable Reinforcement Learning
Prem-ium
🤖 Automate Bing Searches 🔍, Quizzes 🧪, Polls 📝, & more for Bing Rewards. 💸
SwordPuffin
Bring color to Adwaita
j-min
PyTorch code for "Fine-grained Image Captioning with CLIP Reward" (Findings of NAACL 2022)
RLHFlow
Recipes to train the self-rewarding reasoning LLMs.
huaisha1224
使用Python和Selenium赚取每日Microsoft Rewards积分的自动化解决方案,油猴脚本
minaek
No description available
xlang-ai
[ICLR 2024 Spotlight] Text2Reward: Reward Shaping with Language Models for Reinforcement Learning
rewarped
Reinforcement learning in differentiable multiphysics simulation with NVIDIA Warp.
matda59
Create Chores and Rewards for your family
silencio-transparency
No description available
chiihero
微软奖励脚本,基于他人脚本实现中文环境的本地化