5 324

Literate Goggles

literate-goggles

AI & ML interests

None yet

Recent Activity

upvoted a paper about 17 hours ago

SemantiCodec: An Ultra Low Bitrate Semantic Audio Codec for General Sound

upvoted a paper about 22 hours ago

Tensor Product Attention Is All You Need

upvoted a paper about 23 hours ago

The Lessons of Developing Process Reward Models in Mathematical Reasoning

View all activity

Organizations

None yet

literate-goggles's activity

upvoted a paper about 17 hours ago

SemantiCodec: An Ultra Low Bitrate Semantic Audio Codec for General Sound

Paper • 2405.00233 • Published Apr 30, 2024 • 15

upvoted a paper about 22 hours ago

Tensor Product Attention Is All You Need

Paper • 2501.06425 • Published 4 days ago • 47

upvoted a paper about 23 hours ago

The Lessons of Developing Process Reward Models in Mathematical Reasoning

Paper • 2501.07301 • Published 2 days ago • 62

upvoted a paper 2 days ago

VideoRAG: Retrieval-Augmented Generation over Video Corpus

Paper • 2501.05874 • Published 5 days ago • 56

upvoted 3 papers 3 days ago

Efficiently Serving LLM Reasoning Programs with Certaindex

Paper • 2412.20993 • Published 16 days ago • 33

Explanatory Instructions: Towards Unified Vision Tasks Understanding and Zero-shot Generalization

Paper • 2412.18525 • Published 22 days ago • 68

Stable Audio Open

Paper • 2407.14358 • Published Jul 19, 2024 • 25

upvoted 2 papers 6 days ago

Perceiver: General Perception with Iterative Attention

Paper • 2103.03206 • Published Mar 4, 2021 • 1

rStar-Math: Small LLMs Can Master Math Reasoning with Self-Evolved Deep Thinking

Paper • 2501.04519 • Published 7 days ago • 218

upvoted 5 papers 7 days ago

Learning to Speak Fluently in a Foreign Language: Multilingual Speech Synthesis and Cross-Language Voice Cloning

Paper • 1907.04448 • Published Jul 9, 2019 • 1

Domain-Adversarial Training of Neural Networks

Paper • 1505.07818 • Published May 28, 2015 • 1

upvoted 4 papers 9 days ago

Graph Generative Pre-trained Transformer

Paper • 2501.01073 • Published 13 days ago • 17

SDPO: Segment-Level Direct Preference Optimization for Social Agents

Paper • 2501.01821 • Published 12 days ago • 18

VITA-1.5: Towards GPT-4o Level Real-Time Vision and Speech Interaction

Paper • 2501.01957 • Published 12 days ago • 38

Fewer-token Neural Speech Codec with Time-invariant Codes

Paper • 2310.00014 • Published Sep 15, 2023 • 2

upvoted a paper 10 days ago

Ensembling Large Language Models with Process Reward-Guided Tree Search for Better Complex Reasoning

Paper • 2412.15797 • Published 26 days ago • 17

upvoted a paper 11 days ago

PartGen: Part-level 3D Generation and Reconstruction with Multi-View Diffusion Models

Paper • 2412.18608 • Published 22 days ago • 14