Kyungmin Lee
I am PhD student at KAIST advised by professor Jinwoo Shin, and a research intern at NVIDIA GEAR hosted by Jim Fan and Yuke Zhu. I was a student researcher at Google DeepMind hosted by Yinxiao Li. I closely worked with Kihyuk Sohn via Google University Relations program.
My research interests lie in probabilistic machine learning, generative modeling, and representation learning, with a focus on their applications to visual understanding and generation. Recently, I have been particularly interested in generative modeling for visual synthesis β including image, video, and 3D generation β as a step toward world modeling. Also, I am exploring applications in robotics, especially utilizing video world-models for robotics policy. My previous works have focused on the fundamentals of training diffusion and flow models, as well as their adaptations to tasks such as 3D generation, personalization, and preference-based fine-tuning.
I plan to graduate in 2026 and am seeking industry research positions. Please feel free to reach out if youβre interested!
Email: kyungmnlee (at) kaist (dot) ac (dot) kr | Curriculum Vitae
π Publications
World Action Models are Zero-Shot Policies
Seonghyeon Ye,
Yunhao Ge,
Kaiyuan Zheng,
Shenyuan Gao,
Sihyun Yu,
George Kurian,
Suneel Indupuru,
You Liang Tan,
Chuning Zhu,
Jiannan Xiang,
Ayaan Malik,
Kyungmin Lee,
William Liang,
Nadun Ranawaka,
Jiasheng Gu,
Yinzhen Xu,
Guanzhi Wang,
Fengyuan Hu,
Avnish Narayan,
Johan Bjorck,
Jing Wang,
Gwanghyun Kim,
Dantong Niu,
Ruijie Zheng,
Yuqi Xie,
Jimmy Wu,
Qi Wang,
Ryan Julian,
Danfei Xu,
Yilun Du,
Yevgen Chebotar,
Scott Reed,
Jan Kautz,
Yuke Zhu,
Jim Fan,
Joel Jang
Dual-Stream Diffusion for World-Model Augmented Vision-Language-Action Model
John Won,
Kyungmin Lee,
Huiwon Jang,
Dongyoung Kim,
Jinwoo Shin
Decoupled MeanFlow: Turning Flow Models into Flow Maps for Accelerated Sampling
Kyungmin Lee,
Sihyun Yu,
Jinwoo Shin
Calibrated Multi-Preference Optimization for Aligning Diffusion Models
Kyungmin Lee,
Xiaohang Li,
Qifei Wang,
Junfeng He,
Junjie Ke,
Ming-Hsuan Yang,
Irfan Essa,
Jinwoo Shin,
Feng Yang,
Yinxiao Li
Direct Consistency Optimization for Robust Customization of Text-to-Image Diffusion Models
Kyungmin Lee,
Sangkyung Kwak,
Kihyuk Sohn,
Jinwoo Shin
DreamFlow: High-quality Text-to-3D generation by Approximating Probability Flow
Kyungmin Lee,
Kihyuk Sohn,
Jinwoo Shin
Improving Diffusion Models for Authentic Virtual Try-on in the Wild
Yisol Choi,
Sangkyung Kwak,
Kyungmin Lee,
Hyungwon Choi,
Jinwoo Shin
π» Work Experience
- 2025.12 - 2026.03, Research Intern @ NVIDIA GEAR, Remote.
- 2024.07 - 2024.12, Student Researcher @ Google DeepMind, Mountain View, CA, US.
- 2023.02 - 2024.03, University Relation Program @ Google Research, Remote.
π€ Academic Services
- Conference reviewer: NeurIPS, ICML, ICLR, CVPR, ICCV, ECCV, BMCV, WACV, AISTATS
- Journal reviewer: TMLR, TPAMI
π Educations
- 2022.09 - 2026.06, KAIST, Ph.D. in Artificial Intelligence (expected).
- 2015.03 - 2019.02, KAIST, B.S. in Mathematics, Electrical and Computer Engineering (double major).