VideoRAG: Retrieval-Augmented Generation over Video Corpus Paper • 2501.05874 • Published 5 days ago • 54
Enhancing Human-Like Responses in Large Language Models Paper • 2501.05032 • Published 6 days ago • 41
Through-The-Mask: Mask-based Motion Trajectories for Image-to-Video Generation Paper • 2501.03059 • Published 9 days ago • 19
TransPixar: Advancing Text-to-Video Generation with Transparency Paper • 2501.03006 • Published 9 days ago • 21
Magic Mirror: ID-Preserved Video Generation in Video Diffusion Transformers Paper • 2501.03931 • Published 8 days ago • 14
SPAR3D: Stable Point-Aware Reconstruction of 3D Objects from Single Images Paper • 2501.04689 • Published 7 days ago • 16
Sa2VA: Marrying SAM2 with LLaVA for Dense Grounded Understanding of Images and Videos Paper • 2501.04001 • Published 8 days ago • 40
IDOL: Instant Photorealistic 3D Human Creation from a Single Image Paper • 2412.14963 • Published 27 days ago • 6
Large Motion Video Autoencoding with Cross-modal Video VAE Paper • 2412.17805 • Published 23 days ago • 24
WavePulse: Real-time Content Analytics of Radio Livestreams Paper • 2412.17998 • Published 23 days ago • 10
TangoFlux: Super Fast and Faithful Text to Audio Generation with Flow Matching and Clap-Ranked Preference Optimization Paper • 2412.21037 • Published 16 days ago • 23
SeedVR: Seeding Infinity in Diffusion Transformer Towards Generic Video Restoration Paper • 2501.01320 • Published 13 days ago • 11
VideoAnydoor: High-fidelity Video Object Insertion with Precise Motion Control Paper • 2501.01427 • Published 13 days ago • 46
LinGen: Towards High-Resolution Minute-Length Text-to-Video Generation with Linear Computational Complexity Paper • 2412.09856 • Published Dec 13, 2024 • 10
Motion Control for Enhanced Complex Action Video Generation Paper • 2411.08328 • Published Nov 13, 2024 • 5
Number it: Temporal Grounding Videos like Flipping Manga Paper • 2411.10332 • Published Nov 15, 2024 • 14
TEXGen: a Generative Diffusion Model for Mesh Textures Paper • 2411.14740 • Published Nov 22, 2024 • 15