arxiv:2501.02955
Zhuoyi Yang
zyyangzy
AI & ML interests
Multimodal learning
Recent Activity
authored
a paper
7 days ago
MotionBench: Benchmarking and Improving Fine-grained Video Motion
Understanding for Vision Language Models
authored
a paper
9 days ago
VisionReward: Fine-Grained Multi-Dimensional Human Preference Learning
for Image and Video Generation
authored
a paper
5 months ago
CogVLM2: Visual Language Models for Image and Video Understanding
Organizations
None yet