Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
86
27
322
Nathan Lambert
natolambert
Follow
dark-pen's profile picture
nihitdesai's profile picture
Scotto2025's profile picture
138 followers
·
5 following
https://www.natolambert.com/
natolambert
natolambert
AI & ML interests
Reinforcement learning, Ethics, Robotics, Dynamics Models
Recent Activity
liked
a model
about 6 hours ago
ibm-granite/granite-3.1-2b-instruct
updated
a collection
about 6 hours ago
2025 Artifacts
liked
a model
about 16 hours ago
MiniMaxAI/MiniMax-Text-01
View all activity
Articles
Ethics and Society Newsletter #4: Bias in Text-to-Image Models
Jun 26, 2023
•
2
Can foundation models label data like humans?
Jun 12, 2023
•
1
Creating a Coding Assistant with StarCoder
May 9, 2023
•
1
StackLLaMA: A hands-on guide to train LLaMA with RLHF
Apr 5, 2023
•
23
Red-Teaming Large Language Models
Feb 24, 2023
•
18
What Makes a Dialog Agent Useful?
Jan 24, 2023
•
1
Illustrating Reinforcement Learning from Human Feedback (RLHF)
Dec 9, 2022
•
129
Stable Diffusion with 🧨 Diffusers
Aug 22, 2022
•
43
Organizations
natolambert
's activity
All
Models
Datasets
Spaces
Papers
Collections
Community
Posts
Upvotes
Likes
New activity in
allenai/reward-bench
5 days ago
multilingual
2
#8 opened 11 days ago by
ehartford
New activity in
allenai/reward-bench
about 1 month ago
add more contaminated models to the list
2
#7 opened 3 months ago by
arielgera
New activity in
allenai/Llama-3.1-Tulu-3-70B
about 1 month ago
Reason behind not using special tokens in the prompt format?
2
#2 opened about 2 months ago by
Doctor-Shotgun
New activity in
allenai/OLMo-2-1124-13B-Instruct-preview
about 1 month ago
What is that instruction template?
1
#1 opened about 2 months ago by
SerialKicked
New activity in
allenai/Llama-3.1-Tulu-3-70B
about 1 month ago
Why do you use pass@10 to test coding perfmance...
1
#4 opened about 2 months ago by
Leon-Leee
New activity in
allenai/OLMo-2-1124-13B-Instruct-preview
about 1 month ago
Has the data set been expanded?
1
#2 opened about 2 months ago by
win10
New activity in
allenai/tulu-3-sft-personas-algebra
about 1 month ago
Librarian Bot: Add language metadata for dataset
#1 opened about 2 months ago by
librarian-bot
New activity in
allenai/tulu-3-sft-personas-math
about 1 month ago
Add link to Tulu 3 paper
#2 opened about 2 months ago by
gabrielmbmb
New activity in
allenai/llama-3.1-tulu-3-70b-preference-mixture
about 1 month ago
Librarian Bot: Add language metadata for dataset
#1 opened about 2 months ago by
librarian-bot
New activity in
allenai/llama-3.1-tulu-3-8b-preference-mixture
about 1 month ago
Easy way to separate permissive samples
1
#1 opened about 2 months ago by
RASMUS
New activity in
allenai/tulu-3-sft-mixture
about 1 month ago
recommend filter
1
#2 opened about 2 months ago by
ehartford
NuminaMath-TIR License (Apache 2, not CC-BY-NC-4.0)
1
#3 opened about 2 months ago by
rbattle
New activity in
allenai/Llama-3.1-Tulu-3-8B-RM
about 1 month ago
Adding `safetensors` variant of this model
#2 opened about 2 months ago by
SFconvertbot
New activity in
allenai/Llama-3.1-Tulu-3-70B-SFT
about 1 month ago
Adding Evaluation Results
#2 opened about 2 months ago by
leaderboard-pr-bot
New activity in
allenai/Llama-3.1-Tulu-3-8B-DPO
about 1 month ago
Adding `safetensors` variant of this model
#2 opened about 2 months ago by
SFconvertbot
New activity in
allenai/Llama-3.1-Tulu-3-70B-DPO
about 1 month ago
Adding `safetensors` variant of this model
#3 opened about 2 months ago by
SFconvertbot
New activity in
allenai/Llama-3.1-Tulu-3-70B
about 1 month ago
Spelling Error in Section 5.4 - "then" should be "than"
1
#3 opened about 2 months ago by
eliuakk
New activity in
allenai/Llama-3.1-Tulu-3-8B
about 1 month ago
Feedback
1
#2 opened about 2 months ago by
KeyboardMasher
New activity in
allenai/Llama-3.1-Tulu-3-8B-RM
about 2 months ago
Update README.md
#1 opened about 2 months ago by
reach-vb
New activity in
allenai/Llama-3.1-Tulu-3-70B-SFT
about 2 months ago
Update README.md
#1 opened about 2 months ago by
reach-vb
Load more