Multimedia Semantic Analytics Lab
Multimedia Semantic Analytics Lab
Home
People
Publications
Contact
Qingyu Shi
Latest
RecTok: Reconstruction Distillation along Rectified Flow
Does Hearing Help Seeing? Investigating Audio-Video Joint Denoising for Video Generation
Muddit: Liberating generation beyond text-to-image with a unified discrete diffusion model
Decouple and track: Benchmarking and improving video diffusion transformers for motion transfer
DreamRelation: Bridging Customization and Relation Generation
RMP-SAM: Towards Real-Time Multi-Purpose Segment Anything
Cite
×