Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
2
1
6
Zhaolin Gao
GitBag
Follow
kirankc's profile picture
dark-pen's profile picture
2 followers
·
0 following
https://zhaolingao.github.io/
AI & ML interests
Reinforcement Learning from Human Feedback
Recent Activity
updated
a dataset
2 days ago
GitBag/llama3-uf-dp-from1735956551-same-turn
updated
a dataset
2 days ago
GitBag/llama3-uf-dp-from1735956551-reinforce
updated
a model
2 days ago
GitBag/reasoning_rebel_uf_dp_1k1k_from1735956551_oa_eta_1e5_lr_3e-7_mosaic_1736771778
View all activity
Articles
RLHF 101: A Technical Dive into RLHF
Dec 11, 2024
•
4
Organizations
GitBag
's activity
All
Models
Datasets
Spaces
Papers
Collections
Community
Posts
Upvotes
Likes
liked
3 models
4 months ago
Cornell-AGI/REBEL-Llama-3-Armo-iter_1
Updated
Sep 2, 2024
•
4
•
1
Cornell-AGI/REBEL-Llama-3-Armo-iter_2
Updated
Sep 2, 2024
•
5
•
2
Cornell-AGI/REBEL-Llama-3-Armo-iter_3
Updated
Sep 2, 2024
•
5
•
2
liked
3 models
7 months ago
Cornell-AGI/REBEL-Llama-3-epoch_2
Text Generation
•
Updated
Sep 1, 2024
•
23
•
3
Cornell-AGI/REBEL-OpenChat-3.5
Text Generation
•
Updated
Sep 1, 2024
•
16
•
1
Cornell-AGI/REBEL-Llama-3
Text Generation
•
Updated
Sep 1, 2024
•
17
•
1