Multimodal Models Collection Multimodal models with leading performance. • 15 items • Updated 1 day ago • 25
rStar-Math: Small LLMs Can Master Math Reasoning with Self-Evolved Deep Thinking Paper • 2501.04519 • Published 7 days ago • 218
Riva Collection A family of Riva production (NVAIE) speech models that achieve state-of-the-art results on speech transcription, translation, and synthesis tasks. • 1 item • Updated 4 days ago • 3
YuLan-Mini Collection A highly capable 2.4B lightweight LLM using only 1T pre-training data with all details. • 5 items • Updated 17 days ago • 10
DRT-o1: Optimized Deep Reasoning Translation via Long Chain-of-Thought Paper • 2412.17498 • Published 23 days ago • 21
InternVL2.5-MPO Collection Enhancing the Reasoning Ability of MLLMs via Mixed Preference Optimization • 16 items • Updated 5 days ago • 24
TACO Models Collection This collection contains the best-performing TACO models based on LLaMA-3/Qwen2 and SigLIP/CLIP. • 3 items • Updated 26 days ago • 8
ModernBERT Collection Bringing BERT into modernity via both architecture changes and scaling • 3 items • Updated 27 days ago • 123
Bamba Collection Collection of Bamba - hybrid Mamba2 model architecture based models trained on open data • 8 items • Updated 28 days ago • 18