Yadong Mu

The code and pre-trained models for our new diffusion model Pyramid-Flow are released.
The project sites for our text-to-video generative model Video-LaVIT and text-to-3d scene generative model InstructScene are on. Paper, demos and part of the code are available.
We have released the code for vision-language foundation model LaVIT.(10/2023).
We have a number of papers accepted by CVPR, ICCV, ICML, WWW, ACMMM and NeurIPS. (10/2023)
Prof. Yadong Mu received IEEE TMM's Best Associate Editor Award in 2022. (12/2022).
Congratulations to Kangqi Ma et al for the 2nd prize of SAPIEN ManiSkill Challenge 2021 (no external data track). (2/2022).
Prof. Yadong Mu is appointed as an associate editor of IEEE Transactions on Multimedia. (8/2021).
1 paper is accepted by NeurIPS 2020. (9/2020).
5 papers are accepted by CVPR 2020. (2/2020).
Prof. Yadong Mu will serve as an Associate Editor of Neurocomputing (2/2020).
Prof. Yadong Mu will serve as an Area Chair of ACM Multimedia 2020 and CVPR 2021. (1/2020).
We won the second place in the "Temporal Localization" task in ActivityNet Challenge 2019. (7/2019).
In the spring semester of 2018, I will teach a new course "Computer Vision and Deep Learning" for undergraduate students in EECS, Peking University. (11/2017)
One paper collaborated with UESTC and Tencent AI Lab won the Best Paper Honourable Mentions at SIGIR 2017. [Link] (08/2017)
Call for Paper -- ACM Multimedia Workshop on Visual Analysis for Smart and Connected Communities. [Link] (06/2017)
I will teach a course advanced topics in computer vision" (course ID: 04802034), and co-teach the other course "deep learning" (course ID: 08408005) in the spring semester. The former will majorly discuss recent advances in computer vision and the latter will cover both deep learning theory and applications. (02/2017)
We won the first place out of 100+ teams in the "traffic sign detection in autonomous driving" competition (preliminary round) organized by China Computer Federation (CCF) and UISEE (a self-driving car startup). (11/2016)
Our team participated 2016 TRECVID MED (multimedia event detection) competition organized by National Institute of Standards and Technology (NIST). Our multi-modal MED system achieved top performance in three sub-tasks in the PS-100Ex setting. (10/2016)
Our team won the second place in RACV 2016 Iqiyi Video Annotation Challenge.
Prof. Yadong Mu will join the Institute of Computer Science and Technology, Peking University as a tenure track faculty and principal investigator. (05/2016)