Weizhe Yuan's picture

1 6 16

Weizhe Yuan

weizhey

·

AI & ML interests

NLP

Recent Activity

upvoted a paper about 1 month ago

O1 Replication Journey -- Part 2: Surpassing O1-preview through Simple Distillation, Big Progress or Bitter Lesson?

authored a paper about 2 months ago

O1 Replication Journey -- Part 2: Surpassing O1-preview through Simple Distillation, Big Progress or Bitter Lesson?

upvoted a paper about 2 months ago

Adaptive Decoding via Latent Preference Optimization

View all activity

Organizations

weizhey's activity

upvoted a paper about 1 month ago

O1 Replication Journey -- Part 2: Surpassing O1-preview through Simple Distillation, Big Progress or Bitter Lesson?

Paper • 2411.16489 • Published Nov 25, 2024 • 42

upvoted a paper about 2 months ago

Adaptive Decoding via Latent Preference Optimization

Paper • 2411.09661 • Published Nov 14, 2024 • 10

upvoted a paper 6 months ago

Meta-Rewarding Language Models: Self-Improving Alignment with LLM-as-a-Meta-Judge

Paper • 2407.19594 • Published Jul 28, 2024 • 20

upvoted a paper 12 months ago

Self-Rewarding Language Models

Paper • 2401.10020 • Published Jan 18, 2024 • 146

upvoted 2 papers over 1 year ago

System-Level Natural Language Feedback

Paper • 2306.13588 • Published Jun 23, 2023 • 10

reStructured Pre-training

Paper • 2206.11147 • Published Jun 22, 2022 • 1