Multimedia Semantic Analytics Lab
Multimedia Semantic Analytics Lab
Home
People
Publications
Contact
Ye Tian
Latest
Does Hearing Help Seeing? Investigating Audio-Video Joint Denoising for Video Generation
MMaDA-Parallel: Multimodal Large Diffusion Language Models for Thinking-Aware Editing and Generation
Vmoba: Mixture-of-block attention for video diffusion models
Mmada: Multimodal large diffusion language models
Training-free diffusion acceleration with bottleneck sampling
Diffusion-sharpening: Fine-tuning diffusion models with denoising trajectory sharpening
Cite
×