-
Navigating the Digital World as Humans Do: Universal Visual Grounding for GUI Agents
Paper • 2410.05243 • Published • 18 -
GPT-4V(ision) is a Generalist Web Agent, if Grounded
Paper • 2401.01614 • Published • 22 -
osunlp/UGround
Image-Text-to-Text • Updated • 2.35k • 19 -
osunlp/UGround-V1-2B
Image-Text-to-Text • Updated • 685 • 6
Boyu Gou
BoyuNLP
AI & ML interests
AI Agents, Foundation Models, GUI Agents
Recent Activity
liked
a model
about 21 hours ago
osunlp/UGround-V1-7B
liked
a model
about 21 hours ago
osunlp/UGround-V1-72B-Preview
liked
a model
about 21 hours ago
osunlp/UGround-V1-72B
Organizations
Collections
1
models
None public yet
datasets
None public yet