arxiv:2411.13676
Min-Hung Chen
cmhungsteve
AI & ML interests
Multimodal AI, Transfer Learning, Unsupervised Learning, Video Understanding, Vision Transformer, Computer Vision, Deep Learning
Recent Activity
upvoted
a
paper
about 6 hours ago
Omni-RGPT: Unifying Image and Video Region-level Understanding via Token
Marks
commented on
a paper
about 6 hours ago
Omni-RGPT: Unifying Image and Video Region-level Understanding via Token
Marks
upvoted
a
paper
about 2 months ago
Hymba: A Hybrid-head Architecture for Small Language Models
Organizations
Papers
24
models
None public yet
datasets
None public yet