Vaibhav Srivastav's picture

Vaibhav Srivastav PRO

reach-vb

·

https://vaibhavs10.github.io

AI & ML interests

TTS + LM performance prediction

Recent Activity

upvoted an article 4 minutes ago

MiniMax-01 is Now Open-Source: Scaling Lightning Attention for the AI Agent Era

upvoted an article 35 minutes ago

Diving into MiniMax01 405B MoE

new activity about 1 hour ago

MiniMaxAI/MiniMax-Text-01:Add Transformers as the library

View all activity

Articles

Faster Text Generation with Self-Speculative Decoding

Llama can now see and run on your device - welcome Llama 3.2

Google releases Gemma 2 2B, ShieldGemma and Gemma Scope

Llama 3.1 - 405B, 70B & 8B with multilinguality and long context

WWDC 24: Running Mistral 7B with Core ML

Welcome Gemma 2 - Google's new open LLM

Powerful ASR + diarization + speculative decoding with Hugging Face Inference Endpoints

CodeGemma - an official Google release for code LLMs

TTS Arena: Benchmarking Text-to-Speech Models in the Wild

AI Watermarking 101: Tools and Techniques

Deploy MusicGen in no time with Inference Endpoints

Jupyter X Hugging Face

Swift Diffusers: Fast Stable Diffusion for Mac

Organizations

reach-vb's activity

upvoted an article 4 minutes ago

Article

MiniMax-01 is Now Open-Source: Scaling Lightning Attention for the AI Agent Era

By

•

34 minutes ago

• 4

upvoted an article 35 minutes ago

Article

Diving into MiniMax01 405B MoE

By

•

41 minutes ago

• 2

upvoted a paper 6 days ago

rStar-Math: Small LLMs Can Master Math Reasoning with Self-Evolved Deep Thinking

Paper • 2501.04519 • Published 7 days ago • 218

upvoted a collection 7 days ago

Sa2VA model zoo

4 items • Updated about 7 hours ago • 23

upvoted a collection 8 days ago

Cosmos

The collection of Cosmos models • 31 items • Updated 4 days ago • 224

upvoted a paper 12 days ago

2 OLMo 2 Furious

Paper • 2501.00656 • Published 15 days ago • 15

upvoted a collection 12 days ago

OLMo 2

Artifacts for the second set of OLMo models. • 22 items • Updated 9 days ago • 74

upvoted 2 collections 13 days ago

Yi VL

2 items • Updated May 11, 2024 • 2

Falcon2

5 items • Updated 7 days ago • 5

upvoted 5 collections 15 days ago

QVQ

QVQ: Qwen models for visual reasoning • 7 items • Updated 14 days ago • 40

Llama 3.3

This collection hosts the transformers and original repos of the Llama 3.3 • 1 item • Updated Dec 6, 2024 • 110

Qwen2.5

Qwen2.5 language models, including pretrained and instruction-tuned models of 7 sizes, including 0.5B, 1.5B, 3B, 7B, 14B, 32B, and 72B. • 45 items • Updated Nov 28, 2024 • 461

Chameleon

Repository for Meta Chameleon, a mixed-modal early-fusion foundation model from FAIR. • 2 items • Updated Jul 9, 2024 • 28

Stable Diffusion 3

Stable Diffusion 3 and related models for text-to-image and image-to-image • 2 items • Updated 6 days ago • 95

upvoted a collection 20 days ago

DeepSeek-V3

3 items • Updated 9 days ago • 114

upvoted 2 collections 23 days ago

NeMo Audio Codecs

A series of Neural Audio Codecs • 5 items • Updated 4 days ago • 10

InternVL2.5-MPO

Enhancing the Reasoning Ability of MLLMs via Mixed Preference Optimization • 16 items • Updated 5 days ago • 24

upvoted an article 23 days ago

Article

FineWeb2-C: Help Build Better Language Models in Your Language

By

•

23 days ago

• 12

upvoted a collection 26 days ago

📐 FineMath

FineMath datasets and ablation models • 14 items • Updated 9 days ago • 17

upvoted a collection 27 days ago

Bamba

Collection of Bamba - hybrid Mamba2 model architecture based models trained on open data • 8 items • Updated 28 days ago • 18