9 20 11

Wei Liu

PeterV09

https://vpeterv.github.io

AI & ML interests

Machine Learning, Natural Language Processing

Recent Activity

updated a model about 1 hour ago

RL4Reasoning/dart-math-prop2diff-v1

published a model about 1 hour ago

RL4Reasoning/dart-math-prop2diff-v1

updated a model about 9 hours ago

hkustnlpcot/Qwen-1.5B-v2-16384r

View all activity

Organizations

PeterV09's activity

updated a model about 1 hour ago

RL4Reasoning/dart-math-prop2diff-v1

Updated about 1 hour ago

published a model about 1 hour ago

RL4Reasoning/dart-math-prop2diff-v1

Updated about 1 hour ago

updated a model about 9 hours ago

hkustnlpcot/Qwen-1.5B-v2-16384r

Updated about 9 hours ago

updated a model about 10 hours ago

hkustnlpcot/Qwen-7B-v2-16384r

Updated about 10 hours ago

updated 2 models about 12 hours ago

hkustnlpcot/Qwen-7B-v1-8192r

Updated about 12 hours ago

hkustnlpcot/Qwen-7B-v1

Updated about 12 hours ago

liked a model about 13 hours ago

MiniMaxAI/MiniMax-Text-01

Text Generation • Updated about 3 hours ago • 132 • 254

updated 2 datasets 1 day ago

hkustnlpcot2/Math-Level-1-5

Viewer • Updated 1 day ago • 11.5k • 1

hkustnlpcot2/Math-Level-5

Viewer • Updated 1 day ago • 3.36k

upvoted 3 papers 7 days ago

Sa2VA: Marrying SAM2 with LLaVA for Dense Grounded Understanding of Images and Videos

Paper • 2501.04001 • Published 8 days ago • 40

Cosmos World Foundation Model Platform for Physical AI

Paper • 2501.03575 • Published 8 days ago • 61

REINFORCE++: A Simple and Efficient Approach for Aligning Large Language Models

Paper • 2501.03262 • Published 12 days ago • 79

upvoted a paper 8 days ago

PRMBench: A Fine-grained and Challenging Benchmark for Process-Level Reward Models

Paper • 2501.03124 • Published 9 days ago • 13

upvoted a paper 13 days ago

OS-Genesis: Automating GUI Agent Trajectory Construction via Reverse Task Synthesis

Paper • 2412.19723 • Published 19 days ago • 78

updated a collection 21 days ago

M-STAR

Collection

Resources of M-STAR (Multimodal Self-Evolving Training for Reasoning) https://mstar-lmm.github.io/ • 2 items • Updated 21 days ago • 2

updated 2 models 21 days ago

hkust-nlp/mstar-prm-8b-v1.0

Updated 21 days ago • 11 • 1

hkust-nlp/mstar-8b-v1.0

Updated 21 days ago • 10 • 2

upvoted a paper 23 days ago

Diving into Self-Evolving Training for Multimodal Reasoning

Paper • 2412.17451 • Published 23 days ago • 42

commented a paper 23 days ago

Diving into Self-Evolving Training for Multimodal Reasoning

Paper • 2412.17451 • Published 23 days ago • 42 •