9 11 7

WeihaoZeng

AndrewZeng

https://github.com/Zeng-WH

AI & ML interests

None yet

Recent Activity

authored a paper about 5 hours ago

B-STaR: Monitoring and Balancing Exploration and Exploitation in Self-Taught Reasoners

upvoted a paper 5 days ago

Search-o1: Agentic Search-Enhanced Large Reasoning Models

upvoted a paper 7 days ago

REINFORCE++: A Simple and Efficient Approach for Aligning Large Language Models

View all activity

Organizations

Papers 8

models 3

datasets 60

AndrewZeng/math-bstar-sample

Viewer • Updated 22 days ago • 11.5k • 13

AndrewZeng/bstar-math-dev

Viewer • Updated 22 days ago • 604 • 38

AndrewZeng/prm-reward-data

Viewer • Updated 22 days ago • 240k • 31

AndrewZeng/math-trn-format

Viewer • Updated 22 days ago • 11.5k • 39

AndrewZeng/math_scaling

Viewer • Updated Oct 7, 2024 • 100 • 23

AndrewZeng/random_syn

Viewer • Updated Jun 14, 2024 • 108k • 12

AndrewZeng/medium_syn_mistral_20w_mistral_infer_part_4

Viewer • Updated Jun 13, 2024 • 38.9k • 31

AndrewZeng/medium_syn_mistral_20w_mistral_infer_part_3

Viewer • Updated Jun 13, 2024 • 38.9k • 33

AndrewZeng/medium_syn_mistral_20w_mistral_infer_part_2

Viewer • Updated Jun 13, 2024 • 38.9k • 29

AndrewZeng/medium_syn_mistral_20w_mistral_infer_part_1

Viewer • Updated Jun 13, 2024 • 38.9k • 28

WeihaoZeng

AI & ML interests

Recent Activity

Organizations

Papers 8

models 3 Sort: Recently updated

datasets 60 Sort: Recently updated

models 3

datasets 60