Jan Kautz

Publications

2025

NeurIPS

Eagle 2.5: Boosting Long-Context Post-Training for Frontier Vision-Language Models

G. Chen, Z. Li, S. Wang, J. Jiang, Y. Liu, L. Lu, D.-A. Huang, W. Byeon, M. Le, T. Rintamaki, T. Poon, M. Ehrlich, T. Lu, L. Wang, B. Catanzaro, J. Kautz, A. Tao, Z. Yu, G. Liu

Advances in Neural Information Processing Systems (NeurIPS)

December 2025

NeurIPS

CLIMB: Clustering-based Iterative Data Mixture Bootstrapping for Language Model Pre-training

S. Diao, Y. Yang, Y. Fu, X. Dong, D. Su, M. Kliegl, Z. Chen, P. Belcak, Y. Suhara, H. Yin, M. Patwary, Y. C. Lin, J. Kautz, P. Molchanov

Advances in Neural Information Processing Systems (NeurIPS)

December 2025

NeurIPS

GSPN-2: Efficient Parallel Sequence Modeling

H. Wang, Y. Liang, D. Wehr, H. Ye, X. Li, K. C. Cheung, K. Han, H. Yin, P. Molchanov, S. Liu, W. Byeon, C. McCarthy, J. Gu, J. Kautz, K. Chen

Advances in Neural Information Processing Systems (NeurIPS)

December 2025

NeurIPS

Scaling RL to Long Videos

Y. Chen, W. Huang, B. Shi, Q. Hu, H. Ye, L. Zhu, Z. Liu, P. Molchanov, J. Kautz, X. Qi, S. Liu, H. Yin, Y. Lu, S. Han

Advances in Neural Information Processing Systems (NeurIPS)

December 2025

NeurIPS

Fast-SLM: Towards Latency-Optimal Hybrid Small Language Models

Y. Fu, X. Dong, S. Diao, M. V. Keirsbilck, H. Ye, W. Byeon, Y. Karnati, L. Liebenwein, M. Khadkevich, A. Keller, J. Kautz, Y. C. Lin, P. Molchanov

Advances in Neural Information Processing Systems (NeurIPS)

December 2025

NeurIPS

Efficient Hybrid Language Model Compression through Group-Aware SSM Pruning

A. Taghibakhshi, S. T. Sreenivas, S. Muralidharan, M. Chochowski, Y. Karnati, R. B. Joshi, A. S. Mahabaleshwarkar, Z. Chen, Y. Suhara, O. Olabiyi, D. Korzekwa, M. Patwary, M. Shoeybi, J. Kautz, B. Catanzaro

Advances in Neural Information Processing Systems (NeurIPS)

December 2025

NeurIPS

ProRL: Prolonged Reinforcement Learning Expands Reasoning Boundaries in Large Language Models

Advances in Neural Information Processing Systems (NeurIPS)

December 2025

ICCV

AdaHuman: Animatable Detailed 3D Human Generation with Compositional Multiview Diffusion

Y. Huang, Y. Yuan, X. Li, J. Kautz, U. Iqbal

IEEE International Conference on Computer Vision (ICCV)

October 2025

ICCV

GeoMan: Temporally Consistent Human Geometry Estimation using Image-to-Video Diffusion

G. Kim, X. Li, Y. Yuan, K. Nagano, T. Li, J. Kautz, S. Y. Chun, U. Iqbal

IEEE International Conference on Computer Vision (ICCV)

October 2025

ICCV

HumanOLAT: A Large-Scale Dataset for Full-Body Human Relighting and Novel-View Synthesis

T. Teufel, X. Zhou, U. Iqbal, P. Rao, P. Gera, J. Kautz, V. Golyanik, C. Theobalt

IEEE International Conference on Computer Vision (ICCV)

October 2025

ICCV

GEM: A GENeralist Model for Human MOtion

J. Li, J. Cao, H. Zhang, D. Rempe, J. Kautz, U. Iqbal, Y. Yuan

IEEE International Conference on Computer Vision (ICCV)

October 2025

CoRL

DreamGen: Unlocking Generalization in Robot Learning through Video World Models

J. Jang, S. Ye, Z. Lin, J. Xiang, J. Bjorck, Y. Fang, F. Hu, S. Huang, K. Kundalia, L. Magne, A. Mandlekar, A. Narayan, Y. L. Tan, G. Wang, J. Wang, Q. Wang, Y. Xu, K. Zheng, R. Zheng, L. Zettlemoyer, D. Fox, J. Kautz, S. Reed, Y. Zhu, L. Fan

Conference on Robot Learning (CoRL)

September 2025

CoRL

FLARE: Robot Learning with Implicit World Modeling

R. Zheng, J. Wang, S. Reed, Y. Fang, F. Hu, J. Jang, K. Kundalia, Z. Lin, L. Magne, A. Narayan, Y. L. Tan, G. Wang, Q. Wang, J. Xiang, Y. Xu, S. Ye, J. Kautz, F. Huang, Y. Zhu, L. Fan

Conference on Robot Learning (CoRL)

September 2025

TMLR

Wolf: Dense Video Captioning with a World Summarization Framework

B. Li, L. Zhu, R. Tian, S. Tan, Y. Chen, Y. Lu, Y. Cui, S. Veer, M. Ehrlich, J. Philion, X. Weng, F. Xue, L. Fan, Y. Zhu, J. Kautz, A. Tao, M.-Y. Liu, S. Fidler, B. Ivanovic, T. Darrell, J. Malik, S. Han, M. Pavone

Transactions on Machine Learning Research

September 2025

ArXiV

NVIDIA Nemotron Nano 2: An Accurate and Efficient Hybrid Mamba-Transformer Reasoning Model

NVIDIA

ArXiV

August 2025

ICML

FeatSharp: Your Vision Model Features, Sharper

M. Ranzinger, G. Heinrich, P. Molchanov, J. Kautz, B. Catanzaro, A. Tao

International Conference on Machine Learning (ICML)

July 2025

ICML

LaCache: Ladder-Shaped KV Caching for Efficient Long-Context Modeling of Large Language Models

D. Shi, Y. Fu, X. Yuan, Z. Yu, H. You, S. Li, X. Dong, J. Kautz, P. Molchanov, Y. C. Lin

International Conference on Machine Learning (ICML)

July 2025

CVPR

One-Minute Video Generation with Test-Time Training

J. Xu, S. Han, K. Dalal, D. Koceja, X. Li, Y. Zhao, K. C. Cheung, Y. Choi, J. Kautz, S. Liu, Y. Sun, X. Wang

IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

June 2025

CVPR

Parallel Sequence Modeling via Generalization Spatial Propagation Network (GSPN)

H. Wang, W. Byeon, J. Xu, J. Gu, K. C. Cheung, X. Wang, K. Han, J. Kautz, S. Liu

IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

June 2025

CVPR

Scaling Vision Pre-Training to 4K Resolution

B. Shi, B. Li, H. Cai, Y. Lu, S. Liu, M. Pavone, J. Kautz, S. Han, T. Darrell, P. Molchanov, H. Yin

IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

June 2025 (highlight)

CVPR

RADIO Amplified: Improved Baselines for Agglomerative Vision Foundation Models

G. Heinrich, M. Ranzinger, H. Yin, Y. Lu, J. Kautz, B. Catanzaro, A. Tao, P. Molchanov

IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

June 2025

CVPR

FoundationStereo: Zero-Shot Stereo Matching

B. Wen, M. Trepte, O. J. Aribido, J. Kautz, O. Gallo, S. Birchfield

IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

June 2025 (best paper award candidate, oral)

CVPR

Argus: Vision-Centric Reasoning with Grounded Chain-of-Thought

Y. Man, D.-A. Huang, G. Liu, S. Sheng, S. Liu, L. Gui, J. Kautz, Y.-X. Wang, Z. Yu

IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

June 2025

CVPR

NVILA: Efficient Frontier Visual Language Models

Z. Liu, L. Zhu, B. Shi, Z. Zhang, Y. Lou, S. Yang, H. Xi, S. Cao, Y. Gu, D. Li, X. Li, Y. Fang, Y. Chen, C.-Y. Hsieh, D.-A. Huang, A.-C. Cheng, V. Nath, A. Myronenko, J. Hu, S. Liu, R. Krishna, D. Xu, X. Wang, P. Molchanov, J. Kautz, H. Yin, S. Han, a. Y. Lu

IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

June 2025

CVPR

Mosaic3D: Foundation Dataset and Model for Open-Vocabulary 3D Segmentation

J. Lee, C. Park, J. Choe, Y.-C. F. Wang, J. Kautz, M. Cho, C. Choy

IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

June 2025

CVPR

SimAvatar: Simulation-Ready Avatars with Layered Hair and Clothing

X. Li, Y. Yuan, S. D. Mello, G. Daviet, J. Leaf, M. Macklin, J. Kautz, U. Iqbal

IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

June 2025

CVPR

OmniDrive: A Holistic Vision-Language Dataset for Autonomous Driving with Counter Factual Reasoning

S. Wang, Z. Yu, X. Jiang, S. Lan, M. Shi, N. Chang, J. Kautz, Y. Li, J. M. Alvarez

IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

June 2025

CVPR

MambaVision: A Hybrid Mamba-Transformer Vision Backbone

A. Hatamizadeh, J. Kautz

IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

June 2025

RSS

NaVILA: Legged Robot Vision-Language-Action Model for Navigation

A.-C. Cheng, Y. Ji, Z. Yang, Z. Gongye, X. Zou, J. Kautz, E. Biyik, H. Yin, S. Liu, X. Wang

Robotics: Science and Systems (RSS)

June 2025

ICRA

HOVER: Versatile Neural Whole-Body Controller for Humanoid Robots

T. He, W. Xiao, T. Lin, Z. Luo, Z. Xu, Z. Jiang, J. Kautz, C. Liu, G. Shi, X. Wang, L. “. Fan, Y. Zhu

International Conference on Robotics and Automation (ICRA)

May 2025

ArXiV

Nemotron-H: A Family of Accurate and Efficient Hybrid Mamba-Transformer Models

NVIDIA

ArXiV

April 2025

ICLR

Eagle: Exploring The Design Space for Multimodal LLMs with Mixture of Encoders

M. Shi, F. Liu, S. Wang, S. Liao, S. Radhakrishnan, D.-A. Huang, H. Yin, K. Sapra, Y. Yacoob, H. Shi, B. Catanzaro, A. Tao, J. Kautz, G. Liu, Z. Yu

International Conference on Learning Representations (ICLR)

April 2025

ICLR

LongVILA: Scaling Long-Context Visual Language Models for Long Videos

Y. Chen, F. Xue, D. Li, Q. Hu, L. Zhu, X. Li, Y. Fang, H. Tang, S. Yang, Z. Liu, Y. He, H. Yin, P. Molchanov, J. Kautz, L. Fan, Y. Zhu, Y. Lu, S. Han

International Conference on Learning Representations (ICLR)

April 2025

ICLR

Gated Delta Networks: Improving Mamba2 with Delta Rule

S. Yang, J. Kautz, A. Hatamizadeh

International Conference on Learning Representations (ICLR)

April 2025

ICLR

Hymba: A Hybrid-head Architecture for Small Language Models

X. Dong, Y. Fu, S. Diao, W. Byeon, Z. Chen, A. S. Mahabaleshwarkar, S.-Y. Liu, M. V. Keirsbilck, M.-H. Chen, Y. Suhara, Y. C. Lin, J. Kautz, P. Molchanov

International Conference on Learning Representations (ICLR)

April 2025

ICLR

LlamaFlex: Many-in-One LLMs via Generalized Pruning and Weight Sharing

R. Cai, S. Muralidharan, H. Yin, Z. Wang, J. Kautz, P. Molchanov

International Conference on Learning Representations (ICLR)

April 2025

ICLR

LongMamba: Enhancing Mamba's Long-Context Capabilities via Training-Free Receptive Field Enlargement

Z. Ye, K. Xia, Y. Fu, X. Dong, J. Hong, X. Yuan, S. Diao, J. Kautz, P. Molchanov, Y. Lin

International Conference on Learning Representations (ICLR)

April 2025

ArXiV

GR00T N1: An Open Foundation Model for Generalist Humanoid Robots

NVIDIA

ArXiV

March 2025

Nature

Residual corrective diffusion modeling for km-scale atmospheric downscaling

M. Mardani, N. Brenowitz, Y. Cohen, J. Pathak, C.-Y. Chen, C.-C. Liu, A. Vahdat, M. A. Nabian, T. Ge, A. Subramaniam, K. Kashinath, J. Kautz, M. Pritchard

Nature Communications Earth & Environment

February 2025

2024

NeurIPS

MaskLLM: Learnable Semi-Structured Sparsity for Large Language Models

G. Fang, H. Yin, S. Muralidharan, G. Heinrich, J. Pool, J. Kautz, P. Molchanov, X. Wang

Advances in Neural Information Processing Systems (NeurIPS)

December 2024

NeurIPS

Compact Language Models via Pruning and Knowledge Distillation

S. Muralidharan, S. T. Sreenivas, R. Joshi, M. Chochowski, M. Patwary, M. Shoeybi, B. Catanzaro, J. Kautz, P. Molchanov

Advances in Neural Information Processing Systems (NeurIPS)

December 2024

NeurIPS

SpatialRGPT: Grounded Spatial Reasoning in Vision-Language Models

A.-C. Cheng, H. Yin, Y. Fu, Q. Guo, R. Yang, J. Kautz, X. Wang, S. Liu

Advances in Neural Information Processing Systems (NeurIPS)

December 2024

NeurIPS

CosAE: Learnable Fourier Series for Image Restoration

S. Liu, S. D. Mello, J. Kautz

Advances in Neural Information Processing Systems (NeurIPS)

December 2024

ECCV

COIN: Control-Inpainting Diffusion Prior for Human and Camera Motion Estimation

J. Li, Y. Yuan, D. Rempe, H. Zhang, P. Molchanov, C. Lu, J. Kautz, U. Iqbal

European Conference on Computer Vision (ECCV)

September 2024

ECCV

LITA: Language Instructed Temporal-localization Assistant

D.-A. Huang, S. Liao, S. Radhakrishnan, H. Yin, P. Molchanov, Z. Yu, J. Kautz

European Conference on Computer Vision (ECCV)

September 2024

ECCV

DiffiT: Diffusion Vision Transformers for Image Generation

A. Hatamizadeh, J. Song, G. Liu, J. Kautz, A. Vahdat

European Conference on Computer Vision (ECCV)

September 2024

ICML

Flextron: Many-in-One Flexible Large Language Model

R. Cai, S. Muralidharan, G. Heinrich, H. Yin, Z. Wang, J. Kautz, P. Molchanov

International Conference on Machine Learning (ICML)

July 2024 (oral)

ArXiV

An Empirical Study of Mamba-based Language Models

R. Waleffe, W. Byeon, D. Riach, B. Norick, V. Korthikanti, T. Dao, A. Gu, A. Hatamizadeh, S. Singh, D. Narayanan, G. Kulshreshtha, V. Singh, J. Casper, J. Kautz, M. Shoeybi, B. Catanzaro

ArXiV

June 2024

CVPR

FoundationPose: Unified 6D Pose Estimation and Tracking of Novel Objects

B. Wen, W. Yang, J. Kautz, S. Birchfield

IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

June 2024 (highlight)

CVPR

GAvatar: Animatable 3D Gaussian Avatars with Implicit Mesh Learning

Y. Yuan, X. Li, Y. Huang, S. D. Mello, K. Nagano, J. Kautz, U. Iqbal

IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

June 2024 (highlight)

CVPR

COLMAP-Free 3D Gaussian Splatting

Y. Fu, S. Liu, A. Kulkarni, J. Kautz, A. A. Efros, X. Wang

IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

June 2024 (highlight)

CVPR

VILA: On pretraining for vision language models

J. Lin, H. Yin, W. Ping, Y. Lu, P. Molchanov, A. Tao, H. Mao, J. Kautz, M. Shoeybi, S. Han

IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

June 2024

CVPR

AM-RADIO: Agglomerative Model - Reduce All Domains Into One

M. Ranzinger, G. Heinrich, P. Molchanov, J. Kautz

IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

June 2024

CVPR

Is Ego Status All You Need for Open-Loop End-to-End Autonomous Driving?

Z. Li, Z. Yu, S. Lan, J. Li, J. Kautz, T. Lu, J. M. Alvarez

IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

June 2024

ICLR

FasterViT: Fast Vision Transformers with Hierarchical Attention

A. Hatamizadeh, G. Heinrich, H. Yin, A. Tao, J. M. Alvarez, J. Kautz, P. Molchanov

International Conference on Learning Representations (ICLR)

May 2024

ICLR

3D Reconstruction with Generalizable Neural Fields using Scene Priors

Y. Fu, S. D. Mello, X. Li, A. Kulkarni, J. Kautz, X. Wang, S. Liu

International Conference on Learning Representations (ICLR)

May 2024

ICLR

Learning to Jointly Understand Visual and Tactile Signals

Y. Li, Y. Du, C. Liu, F. Williams, M. Foshey, B. Eckart, J. Kautz, J. B. Tenenbaum, A. Torralba, W. Matusik

International Conference on Learning Representations (ICLR)

May 2024

ICLR

A Variational Perspective on Solving Inverse Problems with Diffusion Models

M. Mardani, J. Song, J. Kautz, A. Vahdat

International Conference on Learning Representations (ICLR)

May 2024

3DV

Field-of-View Agnostic Depth Estimation for Cross-Dataset Generalization

D. Lichy, H. Su, A. Badki, J. Kautz, O. Gallo

International Conference on 3D Vision

March 2024 (oral)

3DV

PACE: Human and Camera Motion Estimation from in-the-wild Videos

M. Kocabas, Y. Yuan, P. Molchanov, Y. Guo, M. Black, O. Hilliges, J. Kautz, U. Iqbal

International Conference on 3D Vision

March 2024

2023

NeurIPS

Generalizable One-shot Neural Head Avatar

X. Li, S. D. Mello, S. Liu, K. Nagano, U. Iqbal, J. Kautz

Advances in Neural Information Processing Systems (NeurIPS)

December 2023

NeurIPS

Convolutional State Space Models for Long-Range Spatiotemporal Modeling

J. T. Smith, S. D. Mello, J. Kautz, S. Linderman, W. Byeon

Advances in Neural Information Processing Systems (NeurIPS)

December 2023

MICCAI

SMRD: SURE-based Robust MRI Reconstruction with Diffusion Models

B. Ozturkler, C. Liu, B. Eckart, M. Mardani, J. Song, J. Kautz

International Conference on Medical Image Computing and Computer Assisted Intervention (MICCAI)

October 2023

ICCV

RANA: Relightable and Articulated Neural Avatars

U. Iqbal, A. Caliskan, K. Nagano, S. Khamis, P. Molchanov, J. Kautz

IEEE International Conference on Computer Vision (ICCV)

October 2023

ICCV

PhysDiff: Physics-Guided Human Motion Diffusion Model

Y. Yuan, J. Song, U. Iqbal, A. Vahdat, J. Kautz

IEEE International Conference on Computer Vision (ICCV)

October 2023 (oral)

SIGGRAPH

Single-Shot Implicit Morphable Faces with Consistent Texture Parameterization

C. Lin, K. Nagano, J. Kautz, E. Chan, U. Iqbal, L. Guibas, G. Wetzstein, S. Khamis

ACM SIGGRAPH

August 2023

ICML

Global Context Vision Transformers

A. Hatamizadeh, H. Yin, J. Kautz, P. Molchanov

International Conference on Machine Learning (ICML)

July 2023

ICML

Loss-Guided Diffusion Models for Plug-and-Play Controllable Generation

J. Song, Q. Zhang, H. Yin, M. Mardani, M.-Y. Liu, J. Kautz, Y. Chen, A. Vahdat

International Conference on Machine Learning (ICML)

July 2023

CVPR

Heterogeneous Continual Learning

D. Madaan, H. Yin, W. Byeon, J. Kautz, P. Molchanov

IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)

June 2023 (highlight)

CVPR

Zero-shot Pose Transfer for Unrigged Stylized 3D Characters

J. Wang, X. Li, S. Liu, S. D. Mello, O. Gallo, X. Wang, J. Kautz

IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)

June 2023

CVPR

The Best Defense is a Good Offense: Adversarial Augmentation Against Adversarial Attacks

I. Frosio, J. Kautz

IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)

June 2023

CVPR

Global Vision Transformer Pruning with Hessian-Aware Saliency

H. Yang, H. Yin, M. Shen, P. Molchanov, H. Li, J. Kautz

IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)

June 2023

CVPR

Recurrence without Recurrence: Stable Video Landmark Detection with Deep Equilibrium Models

P. Micaelli, P. Molchanov, A. Vahdat, H. Yin, J. Kautz

IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)

June 2023

CVPR

BundleSDF: Neural 6-DoF Tracking and 3D Reconstruction of Unknown Objects

B. Wen, J. Tremblay, V. Blukis, S. Tyree, T. Müller, A. Evans, D. Fox, J. Kautz, S. Birchfield

IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)

June 2023

ICLR

Pseudoinverse-Guided Diffusion Models for Inverse Problems

J. Song, A. Vahdat, M. Mardani, J. Kautz

International Conference on Learning Representations (ICLR), 2023

May 2023

ICRA

Online Consistent Video Depth using Continuous Geometric Representations

C. Liu, B. Eckart, J. Kautz

IEEE International Conference on Robotics and Automation (ICRA)

May 2023

IEEE TMI

Do Gradient Inversion Attacks Make Federated Learning Unsafe?

H. Roth, A. Hatamizadeh, H. Yin, P. Molchanov, A. Myronenko, W. Li, P. Dogra, A. Feng, M. Flores, J. Kautz, D. Xu

IEEE Transactions on Medical Imaging

42(7), January 2023

2022

SIGGRAPH ASIA

Learning to Relight Portrait Images via a Virtual Light Stage and Synthetic-to-Real Adaptation

Y.-Y. Yeh, K. Nagano, S. Khamis, J. Kautz, M.-Y. Liu, T.-C. Wang

ACM Transactions on Graphics (Proceedings SIGGRAPH Asia 2022)

41(6), December 2022

MIA

Towards Annotation-efficient Segmentation via Image-to-image Translation

E. Vorontsov, P. Molchanov, M. Gazda, C. Beckham, J. Kautz, S. Kadoury

Medical Image Analysis

82, November 2022

ECCV

LANA: Latency Aware Network Acceleration

P. Molchanov, J. Hall, H. Yin, N. Fusi, J. Kautz, A. Vahdat

European Conference on Computer Vision (ECCV)

October 2022

ECCV

Neural Light Field Estimation for Outdoor Scenes with Differentiable Virtual Object Insertion

Z. Wang, W. Chen, D. Acuna, J. Kautz, S. Fidler

European Conference on Computer Vision (ECCV)

October 2022

CVPR

GLAMR: Global Occlusion-Aware Human Mesh Recovery with Dynamic Cameras

Y. Yuan, U. Iqbal, P. Molchanov, K. Kitani, J. Kautz

IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

June 2022 (oral)

CVPR

A-ViT: Adaptive Tokens for Efficient Vision Transformer

H. Yin, A. Vahdat, J. M. Alvarez, A. Mallya, J. Kautz, P. Molchanov

IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

June 2022 (oral)

CVPR

GradViT: Gradient Inversion of Vision Transformers

A. Hatamizadeh, H. Yin, H. Roth, W. Li, J. Kautz, D. Xu, P. Molchanov

IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

June 2022

CVPR

GroupViT: Zero-Shot Transfer to Semantic Segmentation with Text Supervision

J. Xu, S. D. Mello, S. Liu, W. Byeon, T. Breuel, J. Kautz, X. Wang

IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

June 2022

CVPR

CoordGAN: Self-Supervised Dense Correspondences Emerge from GANs

J. Mu, S. Liu, S. D. Mello, Z. Yu, N. Vasconcelos, X. Wang, J. Kautz

IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

June 2022

CVPR

FreeSOLO: Learning to Segment Objects without Annotations

X. Wang, Z. Yu, S. D. Mello, J. Kautz, A. Anandkumar, C. Shen, J. M. Alvarez

IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

June 2022

ICLR

Learning Continuous Environment Fields via Implicit Functions

X. Li, S. D. Mello, X. Wang, M.-H. Yang, J. Kautz, S. Liu

International Conference on Learning Representations (ICLR)

April 2022

IJCV

Learning Contrastive Representation for Semantic Correspondence

T. Xiao, S. Liu, S. D. Mello, Z. Yu, J. Kautz, M.-H. Yang

International Journal on Computer Vision (IJCV)

March 2022

IJCV

Displacement-Invariant Cost Computation for Stereo Matching

Y. Zhong, C. Loop, W. Byeon, S. Birchfield, Y. Dai, K. Zhang, A. Kamenev, T. Breuel, H. Li, J. Kautz

International Journal on Computer Vision (IJCV)

March 2022

AAAI

Neural Interferometry: Image Reconstruction from Astronomical Interferometers using Implicit Neural Representations

B. Wu, B. Eckart, C. Liu, J. Kautz

AAAI Conference on Artificial Intelligence (AAAI)

February 2022

2021

NeurIPS

Coupled Segmentation and Edge Learning Using Dynamic Graph Propagation

Z. Yu, R. Huang, W. Byeon, S. Liu, G. Liu, T. Breuel, A. Anandkumar, J. Kautz

Neural Information Processing Systems (NeurIPS)

December 2021

NeurIPS

A Contrastive Learning Approach for Training Variational Autoencoder Priors

J. Aneja, A. Schwing, J. Kautz, A. Vahdat

Neural Information Processing Systems (NeurIPS)

December 2021

NeurIPS

Score-based Generative Modeling in Latent Space

A. Vahdat, K. Kreis, J. Kautz

Neural Information Processing Systems (NeurIPS)

December 2021

3DV

KAMA: 3D Keypoint Aware Body Mesh Articulation

U. Iqbal, K. Xie, Y. Guo, J. Kautz, P. Molchanov

International Conference on 3D Vision (3DV)

December 2021

BMVC

Hierarchical Contrastive Motion Learning for Video Action Recognition

X. Yang, X. Yang, S. Liu, D. Sun, L. Davis, J. Kautz

British Machine Vision Conference (BMVC)

November 2021

ICCV

Learning Indoor Inverse Rendering with 3D Spatially-Varying Lighting

Z. Wang, J. Philion, S. Fidler, J. Kautz

International Conference on Computer Vision (ICCV)

October 2021 (oral)

ICCV

Self-Supervised Object Detection via Generative Image Synthesis

S. K. Mustikovela, S. De Mello, A. Prakash, U. Iqbal, S. Liu, T. Nguyen-Phuoc, C. Rother, J. Kautz

International Conference on Computer Vision (ICCV)

October 2021

TPAMI

Domain Stylization: A Fast Covariance Matching Framework towards Domain Adaptation

A. Dundar, M.-Y. Liu, Z. Yu, T.-C. Wang, J. Zedlewski, J. Kautz

IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI)

43(7), July 2021

CVPR

Binary TTC: A Temporal Geofence for Autonomous Navigation

A. Badki, O. Gallo, J. Kautz, P. Sen

IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

June 2021 (Best Student Paper Honorable Mention & oral)

CVPR

Weakly-Supervised Physically Unconstrained Gaze Estimation

R. Kothari, S. De Mello, U. Iqbal, W. Byeon, S. Park, J. Kautz

IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

June 2021 (oral)

CVPR

Learning to Track Instances without Video Annotations

Y. Fu, S. Liu, U. Iqbal, S. De Mello, H. Shi, J. Kautz

IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

June 2021 (oral)

CVPR

See Through Gradients: Image Batch Recovery via GradInversion

H. Yin, A. Mallya, A. Vahdat, J. M. Alvarez, J. Kautz, P. Molchanov

IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

June 2021

CVPR

Self-Supervised Learning on 3D Point Clouds by Learning Latent Generative Models

B. Eckart, W. Yuan, C. Liu, J. Kautz

IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

June 2021

CVPR

DexYCB: A Benchmark for Capturing Hand Grasping of Objects

Y.-W. Chao, W. Yang, A. Handa, Y. Xiang, Y. Narang, K. V. Wyk, U. Iqbal, P. Molchanov, J. Tremblay, S. Birchfield, J. Kautz, D. Fox

IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

June 2021

ICLR

VAEBM: A Symbiosis between Variational Autoencoders and Energy-based Models

Z. Xiao, K. Kreis, J. Kautz, A. Vahdat

International Conference on Learning Representations (ICLR)

May 2021 (spotlight)

ICLR

Parameter Efficient Multimodal Transformers for Video Representation Learning

S. Lee, Y. Yu, G. Kim, T. Breuel, J. Kautz, Y. Song

International Conference on Learning Representations (ICLR)

May 2021

2020

NeurIPS

NVAE: A Deep Hierarchical Variational Autoencoder

A. Vahdat, J. Kautz

Neural Information Processing Systems (NeurIPS)

December 2020 (spotlight)

NeurIPS

Online Adaptation for Consistent Mesh Reconstruction in the Wild

X. Li, S. Liu, S. De Mello, K. Kim, X. Wang, M.-H. Yang, J. Kautz

Neural Information Processing Systems (NeurIPS)

December 2020

NeurIPS

Convolutional Tensor-Train LSTM for Spatio-Temporal Learning

J. Su, W. Byeon, J. Kossaifi, F. Huang, J. Kautz, A. Anandkumar

Neural Information Processing Systems (NeurIPS)

December 2020

ISMAR

Optical Gaze Tracking with Spatially-Sparse Single-Pixel Detectors

R. Li, E. Whitmire, M. Stengel, B. Boudaoud, J. Kautz, D. Luebke, S. Patel, K. Aksit

IEEE International Symposium on Mixed and Augmented Reality (ISMAR)

November 2020

ECCV

Contrastive Learning for Weakly Supervised Phrase Grounding

T. Gupta, A. Vahdat, G. Chechik, X. Yang, J. Kautz, D. Hoiem

European Conference on Computer Vision (ECCV)

August 2020 (spotlight)

ECCV

DeepGMR: Learning Latent Gaussian Mixture Models for Registration

W. Yuan, B. Eckart, K. Kim, V. Jampani, D. Fox, J. Kautz

European Conference on Computer Vision (ECCV)

August 2020 (spotlight)

ECCV

Joint Disentangling and Adaptation for Cross-Domain Person Re-Identification

Y. Zou, X. Yang, Z. Yu, B. V. K. V. Kumar, J. Kautz

European Conference on Computer Vision (ECCV)

August 2020 (oral)

ECCV

Self-supervised Single-view 3D Reconstruction via Semantic Consistency

X. Li, S. Liu, K. Kim, S. De Mello, V. Jampani, M.-H. Yang, J. Kautz

European Conference on Computer Vision (ECCV)

August 2020

ECCV

Weakly Supervised 3D Hand Pose Estimation via Biomechanical Constraints

A. Spurr, P. Molchanov, U. Iqbal, O. Hilliges, J. Kautz

European Conference on Computer Vision (ECCV)

August 2020

ECCV

UFO2: A Unified Framework towards Omni-supervised Object Detection

Z. Ren, Z. Yu, X. Yang, M.-Y. Liu, A. Schwing, J. Kautz

European Conference on Computer Vision (ECCV)

August 2020

ICML

Angular Visual Hardness

B. Chen, W. Liu, A. Garg, Z. Yu, A. Shrivastava, J. Kautz, A. Anandkumar

International Conference on Machine Learning (ICML)

July 2020

CVPR

Dreaming to Distill: Data-free Knowledge Transfer via DeepInversion

H. Yin, P. Molchanov, J. M. Alvarez, Z. Li, A. Mallya, D. Hoiem, N. Jha, J. Kautz

IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

June 2020 (oral)

CVPR

UNAS: Differentiable Architecture Search Meets Reinforcement Learning

A. Vahdat, A. Mallya, M.-Y. Liu, J. Kautz

IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

June 2020 (oral)

CVPR

Self-Supervised Viewpoint Learning from Image Collections

S. K. Mustikovela, V. Jampani, S. De Mello, U. Iqbal, S. Liu, C. Rother, J. Kautz

IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

June 2020

CVPR

Bi3D: Stereo Depth Estimation via Binary Classifications

A. Badki, O. Gallo, A. Troccoli, K. Kim, P. Sen, J. Kautz

IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

June 2020

CVPR

Meshlet Priors for 3D Mesh Reconstruction

A. Badki, O. Gallo, P. Sen, J. Kautz

IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

June 2020

CVPR

Weakly-Supervised 3D Human Pose Learning via Multi-view Images in the Wild

U. Iqbal, P. Molchanov, J. Kautz

IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

June 2020

CVPR

Two-shot Spatially-varying BRDF and Shape Estimation

M. Boss, V. Jampani, K. Kim, H. Lensch, J. Kautz

IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

June 2020

CVPR

Novel View Synthesis of Dynamic Scenes with Globally Coherent Depths from a Monocular Camera

J. S. Yoon, K. Kim, O. Gallo, H. S. Park, J. Kautz

IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

June 2020

CVPR

Instance-aware, Context-focused, and Memory-efficient Weakly-Supervised Object Detection

Z. Ren, Z. Yu, X. Yang, M.-Y. Liu, Y. J. Lee, A. Schwing, J. Kautz

IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

June 2020

IJCV

Exploiting Semantics for Face Image Deblurring

Z. Shen, W.-S. Lai, T. Xu, J. Kautz, M.-H. Yang

International Journal on Computer Vision (IJCV)

March 2020

WACV

NRMVS: Non-Rigid Multi-View Stereo

M. Innmann, K. Kim, J. Gu, M. Niessner, C. Loop, M. Stamminger, J. Kautz

IEEE Winter Conference on Applications of Computer Vision (WACV)

March 2020, pages 2754-2763

2019

NeurIPS

Joint-task Self-supervised Learning for Temporal Correspondence

X. Li, S. Liu, S. De Mello, X. Wang, M.-H. Yang, J. Kautz

Neural Information Processing Systems (NeurIPS)

December 2019

NeurIPS

Dancing to Music

H.-Y. Lee, X. Yang, M.-Y. Liu, T.-C. Wang, Y.-D. Lu, M.-H. Yang, J. Kautz

Neural Information Processing Systems (NeurIPS)

December 2019

NeurIPS

Few-shot Video-to-Video Synthesis

T.-C. Wang, M.-Y. Liu, A. Tao, G. Liu, J. Kautz, B. Catanzaro

Neural Information Processing Systems (NeurIPS)

December 2019

ICCV

Extreme View Synthesis

I. Choi, O. Gallo, A. Troccoli, M. H. Kim, J. Kautz

IEEE International Conference on Computer Vision (ICCV)

October 2019 (oral)

ICCV

SENSE: A Shared Encoder Network for Scene-flow Estimation

H. Jiang, D. Sun, V. Jampani, Z. Lv, E. Learned-Miller, J. Kautz

IEEE International Conference on Computer Vision (ICCV)

October 2019 (oral)

ICCV

Few-shot Adaptive Gaze Estimation

S. Park, S. De Mello, P. Molchanov, U. Iqbal, O. Hilliges, J. Kautz

IEEE International Conference on Computer Vision (ICCV)

October 2019 (oral)

ICCV

Learning Propagation for Arbitrarily-structured Data

S. Liu, X. Li, V. Jampani, S. De Mello, J. Kautz

IEEE International Conference on Computer Vision (ICCV)

October 2019

ICCV

Few-shot Unsupervised Image-to-Image Translation

M.-Y. Liu, X. Huang, A. Mallya, T. Karras, T. Aila, J. Lehtinen, J. Kautz

IEEE International Conference on Computer Vision (ICCV)

October 2019

ICCV

Neural Inverse Rendering of an Indoor Scene from a Single Image

S. Sengupta, J. Gu, K. Kim, G. Liu, D. Jacobs, J. Kautz

IEEE International Conference on Computer Vision (ICCV)

October 2019

ICCV

Unsupervised Video Interpolation Using Cycle Consistency

F. Reda, D. Sun, A. Dundar, M. Shoeybi, G. Liu, K. Shih, A. Tao, J. Kautz, B. Catanzaro

IEEE International Conference on Computer Vision (ICCV)

October 2019

BMVC

Few-Shot Viewpoint Estimation

H.-Y. Tseng, S. De Mello, J. Tremblay, S. Liu, S. Birchfield, M.-H. Yang, J. Kautz

British Machine Vision Conference (BMVC)

September 2019

BMVC

Video Stitching for Linear Camera Arrays

W.-S. Lai, O. Gallo, J. Gu, D. Sun, M.-H. Yang, J. Kautz

British Machine Vision Conference (BMVC)

September 2019

CVPR

Joint Discriminative and Generative Learning for Person Re-identification

Z. Zheng, X. Yang, Z. Yu, L. Zheng, Y. Yang, J. Kautz

IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

June 2019 (oral)

CVPR

STEP: Spatio-Temporal Progressive Learning for Video Action Detection

X. Yang, X. Yang, M.-Y. Liu, F. Xiao, L. Davis, J. Kautz

IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

June 2019 (oral)

CVPR

PlaneRCNN: 3D Plane Detection and Reconstruction from a Single View

C. Liu, K. Kim, J. Gu, Y. Furukawa, J. Kautz

IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

June 2019 (oral)

CVPR

Neural RGB → D Sensing: Depth and Uncertainty from a Video Camera

C. Liu, J. Gu, K. Kim, S. Narasimhan, J. Kautz

IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

June 2019 (Best Paper Finalist & oral)

CVPR

SCOPS: Self-Supervised Co-Part Segmentation

W.-C. Hung, V. Jampani, S. Liu, P. Molchanov, M.-H. Yang, J. Kautz

IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

June 2019

CVPR

Pixel Adaptive Convolutional Neural Networks

H. Su, V. Jampani, D. Sun, O. Gallo, E. Learned-Miller, J. Kautz

IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

June 2019

CVPR

Learning Linear Transformations for Fast Image and Video Style Transfer

X. Li, S. Liu, J. Kautz, M.-H. Yang

IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

June 2019

CVPR

Putting Humans in a Scene: Learning Affordance in 3D Indoor Environments

X. Li, S. Liu, K. Kim, M.-H. Yang, X. Wang, J. Kautz

IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

June 2019

CVPR

Importance Estimation for Neural Network Pruning

P. Molchanov, A. Mallya, S. Tyree, I. Frosio, J. Kautz

IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

June 2019

TPAMI

Models Matter, So Does Training: An Empirical Study of CNNs for Optical Flow Estimation

D. Sun, X. Yang, M.-Y. Liu, J. Kautz

IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI)

?(?), ? 2019

TIP

Statistical Nearest Neighbors for Image Denoising

I. Frosio, J. Kautz

IEEE Transactions on Image Processing

28(2), February 2019, pages 723-728

WACV

A Fusion Approach for Multi-Frame Optical Flow Estimation

Z. Ren, O. Gallo, D. Sun, M.-H. Yang, E. Sudderth, J. Kautz

IEEE Winter Conference on Applications of Computer Vision (WACV)

January 2019