Multimedia Semantic Analytics Lab
Multimedia Semantic Analytics Lab
Home
People
Publications
Contact
Tao Zhang
Latest
Open-o3 Video: Grounded Video Reasoning with Explicit Spatio-Temporal Evidence
Mixed-r1: Unified reward perspective for reasoning capability in multimodal large language models
Sa2va: Marrying sam2 with llava for dense grounded understanding of images and videos
Cite
×