Wen Wang | Zhejiang University
Ph.D. Student
Zhejiang University
wwenxyz (at) zju.edu.cn
About Me
Hi there! My name is Wen Wang (王文 in Chinese). I’m currently a Ph.D. student advised by Prof. Chunhua Shen at Zhejiang University, pursuing a Ph.D. in Computer Science. Prior to my doctoral studies, I obtained my master’s degree from the University of Science and Technology of China in 2022, fortunately supervised by Prof. Yang Cao.
Research Interests
- Multimodal Understanding: Advancing multimodal perception and reasoning.
- Multimodal Generation: Enabling the creation and editing of text, images, and videos.
News
- [Feb. 2026] Four papers, including HoloCine, Ditto, MagicQuillV2, and LivingSwap, are accepted to CVPR 2026 main conference.
- [Jan. 2026] Our paper dLLM-MidTruth and Sat3DGen are accepted to ICLR 2026.
- [Jan. 2026] Our paper FreerCustom is accepted to IJCV.
- [Nov. 2025] Our paper GUI-G2 is accepted to AAAI 2026.
- [Nov. 2025] Our paper LoRA-Composer is accepted to TIP.
- [Sep. 2025] Our paper Omni-R1 is accepted to NeurIPS 2025.
- [Feb. 2025] Four papers, including MagicQuill, AniDoc, LeviTor, and MovieBench, are accepted to CVPR 2025.
- [Jan. 2025] Two papers, including Framer and MovieDreamer, are accepted to ICLR 2025.
- [Dec. 2024] Our paper AutoStory is accepted to IJCV.
- [Jul. 2024] Our paper FreeCompose is accepted to ECCV 2024.
- [Feb. 2024] Our paper FreeCustom is accepted to CVPR 2024.
- [Jan. 2024] Our paper OIR-Diffusion is accepted to ICLR 2024.
- [Jul. 2023] Our paper SegGPT is accepted to ICCV 2023.
- [Feb. 2023] Three papers, including EVA, Painter, and CLAMP, are accepted to ICCV 2023.
Publications
-

Wen Wang, Kangyang Xie, Zide Liu, Hao Chen, Yue Cao, Xinlong Wang, and Chunhua Shen
Arxiv, 2023.
PDF Citations: 100+
-

Yuxin Fang, Wen Wang, Binhui Xie, Quan Sun, Ledell Yu Wu, Xinggang Wang, Tiejun Huang, Xinlong Wang, and Yue Cao
IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2023.
-
Wen Wang, Jing Zhang, Wei Zhai, Yang Cao, and Dacheng Tao
IEEE Transactions on Image Processing (TIP), 2022.
-
Yongliang Shen, Xinyin Ma, Zeqi Tan, Shuai Zhang, Wen Wang, Weiming Lu
Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics, (ACL) 2021.
PDF Citations: 200+
-
Xinlong Wang, Xiaosong Zhang, Yue Cao, Wen Wang, Chunhua Shen, Tiejun Huang
IEEE/CVF International Conference on Computer Vision (ICCV), 2023.
PDF Citations: 500+
Multimodal Generation
Show More
Multimodal Understanding
Show More
Experiences
Research Intern at Ant Research
Apr. 2023 - Now
Research Intern at Beijing Academy of Artificial Intelligence
Jun. 2022 - Mar. 2023
Research Intern at JD Explore Academy
Dec. 2020 - Mar. 2022
Professional Activities
Conference Reviewers
- IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)
- IEEE/CVF International Conference on Computer Vision (ICCV)
- European Conference on Computer Vision (ECCV)
- International Conference on Learning Representations (ICLR)
- Neural Information Processing Systems (NeurIPS)
- ACM Multimedia (ACM MM)
- The IEEE/CVF Winter Conference on Applications of Computer Vision (WACV)
Journal Reviewers
- IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI)
- IEEE Transactions on Circuits and Systems for Video Technology (TCSVT)
- The journal of Artificial Intelligence (AI)

