rStar-Math: Small LLMs Can Master Math Reasoning with Self-Evolved Deep Thinking Paper • 2501.04519 • Published 7 days ago • 218
REINFORCE++: A Simple and Efficient Approach for Aligning Large Language Models Paper • 2501.03262 • Published 12 days ago • 78
DeepSeekMath: Pushing the Limits of Mathematical Reasoning in Open Language Models Paper • 2402.03300 • Published Feb 5, 2024 • 78
cognitivecomputations/OpenCoder-LLM_opc-sft-stage1-DolphinLabeled Viewer • Updated 8 days ago • 3.01M • 46 • 6
view article Article Fine-tune ModernBERT for text classification using synthetic data By davidberenstein1957 • 16 days ago • 22
MahmoudAshraf/mms-300m-1130-forced-aligner Automatic Speech Recognition • Updated Sep 28, 2024 • 1.9M • 36
Mulberry: Empowering MLLM with o1-like Reasoning and Reflection via Collective Monte Carlo Tree Search Paper • 2412.18319 • Published 22 days ago • 36