HuggingFaceM4/siglip-so400m-14-980-flash-attn2-navit Zero-Shot Image Classification • Updated Mar 7, 2024 • 7.03k • 43
ReFocus: Visual Editing as a Chain of Thought for Structured Image Understanding Paper • 2501.05452 • Published 6 days ago • 12
The GAN is dead; long live the GAN! A Modern GAN Baseline Paper • 2501.05441 • Published 6 days ago • 74
timm/vit_base_patch16_clip_224.openai Image Feature Extraction • Updated Oct 23, 2024 • 338k • 6
VideoAnydoor: High-fidelity Video Object Insertion with Precise Motion Control Paper • 2501.01427 • Published 13 days ago • 47
xinyu1205/recognize-anything-plus-model Zero-Shot Image Classification • Updated Oct 25, 2023 • 37