Optimized VideoLLaMA with improved spatial-temporal modeling and better audio understanding capability
Language Technology Lab at Alibaba DAMO Academy
company
AI & ML interests
None defined yet.
Recent Activity
View all activity
Collections
5
spaces
3
models
42
DAMO-NLP-SG/VideoRefer-7B-stage2.5
Visual Question Answering
•
Updated
•
33
•
2
DAMO-NLP-SG/VideoRefer-7B-stage2
Visual Question Answering
•
Updated
•
13
•
1
DAMO-NLP-SG/VideoRefer-7B
Visual Question Answering
•
Updated
•
64
•
2
DAMO-NLP-SG/DiGIT
Unconditional Image Generation
•
Updated
•
4
DAMO-NLP-SG/VideoLLaMA2.1-7B-AV
Visual Question Answering
•
Updated
•
873
•
14
DAMO-NLP-SG/VideoLLaMA2.1-7B-16F
Visual Question Answering
•
Updated
•
2.08k
•
8
DAMO-NLP-SG/VideoLLaMA2.1-7B-16F-Base
Visual Question Answering
•
Updated
•
708
•
1
DAMO-NLP-SG/LiT-B-32_CC12M
Updated
•
1
DAMO-NLP-SG/VideoLLaMA2-72B
Visual Question Answering
•
Updated
•
86
•
10
DAMO-NLP-SG/VideoLLaMA2-72B-Base
Visual Question Answering
•
Updated
•
24
•
1
datasets
9
DAMO-NLP-SG/multimodal_textbook
Updated
•
6.23k
•
99
DAMO-NLP-SG/VideoRefer-Bench
Updated
•
29
DAMO-NLP-SG/CMM
Updated
•
42
•
5
DAMO-NLP-SG/Multi-Source-Video-Captioning
Viewer
•
Updated
•
1.5k
•
59
•
6
DAMO-NLP-SG/LongCorpus-2.5B
Preview
•
Updated
•
36
•
8
DAMO-NLP-SG/SOUL
Viewer
•
Updated
•
15k
•
55
DAMO-NLP-SG/MultiJail
Viewer
•
Updated
•
315
•
48
•
6
DAMO-NLP-SG/HyperlinkMRC
Updated
•
38
•
2
DAMO-NLP-SG/SSTuning-datasets
Updated
•
34