Nathan Lambert's picture

Nathan Lambert

natolambert

·

https://www.natolambert.com/

AI & ML interests

Reinforcement learning, Ethics, Robotics, Dynamics Models

Recent Activity

liked a model about 6 hours ago

ibm-granite/granite-3.1-2b-instruct

updated a collection about 6 hours ago

liked a model about 16 hours ago

MiniMaxAI/MiniMax-Text-01

View all activity

Articles

Ethics and Society Newsletter #4: Bias in Text-to-Image Models

Can foundation models label data like humans?

Creating a Coding Assistant with StarCoder

StackLLaMA: A hands-on guide to train LLaMA with RLHF

Red-Teaming Large Language Models

What Makes a Dialog Agent Useful?

Illustrating Reinforcement Learning from Human Feedback (RLHF)

Stable Diffusion with 🧨 Diffusers

Organizations

natolambert's activity

New activity in allenai/reward-bench 5 days ago

multilingual

#8 opened 11 days ago by

New activity in allenai/reward-bench about 1 month ago

add more contaminated models to the list

#7 opened 3 months ago by

New activity in allenai/Llama-3.1-Tulu-3-70B about 1 month ago

Reason behind not using special tokens in the prompt format?

#2 opened about 2 months ago by

New activity in allenai/OLMo-2-1124-13B-Instruct-preview about 1 month ago

What is that instruction template?

#1 opened about 2 months ago by

New activity in allenai/Llama-3.1-Tulu-3-70B about 1 month ago

Why do you use pass@10 to test coding perfmance...

#4 opened about 2 months ago by

New activity in allenai/OLMo-2-1124-13B-Instruct-preview about 1 month ago

Has the data set been expanded?

#2 opened about 2 months ago by

New activity in allenai/tulu-3-sft-personas-algebra about 1 month ago

Librarian Bot: Add language metadata for dataset

#1 opened about 2 months ago by

New activity in allenai/tulu-3-sft-personas-math about 1 month ago

Add link to Tulu 3 paper

#2 opened about 2 months ago by

New activity in allenai/llama-3.1-tulu-3-70b-preference-mixture about 1 month ago

Librarian Bot: Add language metadata for dataset

#1 opened about 2 months ago by

New activity in allenai/llama-3.1-tulu-3-8b-preference-mixture about 1 month ago

Easy way to separate permissive samples

#1 opened about 2 months ago by

New activity in allenai/tulu-3-sft-mixture about 1 month ago

recommend filter

#2 opened about 2 months ago by

NuminaMath-TIR License (Apache 2, not CC-BY-NC-4.0)

#3 opened about 2 months ago by

New activity in allenai/Llama-3.1-Tulu-3-8B-RM about 1 month ago

Adding `safetensors` variant of this model

#2 opened about 2 months ago by

New activity in allenai/Llama-3.1-Tulu-3-70B-SFT about 1 month ago

Adding Evaluation Results

#2 opened about 2 months ago by

leaderboard-pr-bot

New activity in allenai/Llama-3.1-Tulu-3-8B-DPO about 1 month ago

Adding `safetensors` variant of this model

#2 opened about 2 months ago by

New activity in allenai/Llama-3.1-Tulu-3-70B-DPO about 1 month ago

Adding `safetensors` variant of this model

#3 opened about 2 months ago by

New activity in allenai/Llama-3.1-Tulu-3-70B about 1 month ago

Spelling Error in Section 5.4 - "then" should be "than"

#3 opened about 2 months ago by

New activity in allenai/Llama-3.1-Tulu-3-8B about 1 month ago

Feedback

#2 opened about 2 months ago by

New activity in allenai/Llama-3.1-Tulu-3-8B-RM about 2 months ago

Update README.md

#1 opened about 2 months ago by

New activity in allenai/Llama-3.1-Tulu-3-70B-SFT about 2 months ago

Update README.md

#1 opened about 2 months ago by