Cornell-AGI
's Collections
REBEL: Reinforcement Learning via Regressing Relative Reward
updated
REBEL: Reinforcement Learning via Regressing Relative Rewards
Paper
•
2404.16767
•
Published
•
2
Cornell-AGI/REBEL-Llama-3-Armo-iter_1
Updated
•
4
•
1
Cornell-AGI/REBEL-Llama-3-Armo-iter_2
Updated
•
5
•
2
Cornell-AGI/REBEL-Llama-3-Armo-iter_3
Updated
•
5
•
2
Cornell-AGI/Ultrafeedback-Llama-3-Armo-iter_1
Viewer
•
Updated
•
56.1k
•
30
Cornell-AGI/Ultrafeedback-Llama-3-Armo-iter_2
Viewer
•
Updated
•
55.1k
•
30
Cornell-AGI/Ultrafeedback-Llama-3-Armo-iter_3
Viewer
•
Updated
•
44.6k
•
34
•
1
Cornell-AGI/REBEL-Llama-3
Text Generation
•
Updated
•
17
•
1
Cornell-AGI/REBEL-Llama-3-epoch_2
Text Generation
•
Updated
•
23
•
3
Cornell-AGI/REBEL-OpenChat-3.5
Text Generation
•
Updated
•
16
•
1