arxiv:2501.05452
Jianwei Yang
jw2yang
AI & ML interests
Computer Vision, Vision and Language, Machine Learning
Recent Activity
authored
a paper
2 days ago
ReFocus: Visual Editing as a Chain of Thought for Structured Image
Understanding
authored
a paper
28 days ago
TraceVLA: Visual Trace Prompting Enhances Spatial-Temporal Awareness for
Generalist Robotic Policies
authored
a paper
30 days ago
OLA-VLM: Elevating Visual Perception in Multimodal LLMs with Auxiliary
Embedding Distillation
Organizations
Papers
18
models
None public yet
datasets
None public yet