arxiv:2410.19324
Thomas Mensink
tmensink
AI & ML interests
None yet
Recent Activity
authored
a paper
2 months ago
HAMMR: HierArchical MultiModal React agents for generic VQA
authored
a paper
2 months ago
Scaling Vision Transformers to 22 Billion Parameters
authored
a paper
over 1 year ago
Encyclopedic VQA: Visual questions about detailed properties of
fine-grained categories
Organizations
models
None public yet
datasets
None public yet