Awesome
<!-- # <p align=center>`awesome gan-inversion`</p> --> <!-- [![Awesome](https://cdn.rawgit.com/sindresorhus/awesome/d7305f38d29fed78fa85652e3a63e154dd8e8829/media/badge.svg)](https://github.com/sindresorhus/awesome) [![Maintenance](https://img.shields.io/badge/Maintained%3F-yes-green.svg)](https://GitHub.com/Naereen/StrapDown.js/graphs/commit-activity) [![PR's Welcome](https://img.shields.io/badge/PRs-welcome-brightgreen.svg?style=flat)](http://makeapullrequest.com) --> <!-- ![visitors](https://visitor-badge.glitch.me/badge?style=flat-square&page_id=weihaox/awesome-gan-inversion) --> <!-- <br/> --> <p align="center"> <h1 align="center">GAN Inversion: A Survey</h1> <p align="center"> TPAMI 2022 <br /> <a href="https://weihaox.github.io/"><strong>Weihao Xia</strong></a> · <a href="https://yulunzhang.com/"><strong>Yulun Zhang</strong></a> · <a href="https://sites.google.com/view/iigroup-thu/about"><strong>Yujiu Yang</strong></a> · <a href="http://www.homepages.ucl.ac.uk/~ucakjxu/"><strong>Jing-Hao Xue</strong></a> · <a href="https://boleizhou.github.io/"><strong>Bolei Zhou</strong></a> · <a href="https://faculty.ucmerced.edu/mhyang/"><strong>Ming-Hsuan Yang</strong></a> </p> <p align="center"> <a href='https://arxiv.org/abs/2101.05278'> <img src='https://img.shields.io/badge/Paper-PDF-green?style=flat&logo=arxiv&logoColor=green' alt='arxiv PDF'> </a> <a href='https://github.com/weihaox/awesome-gan-inversion' style='padding-left: 0.5rem;'> <img src='https://img.shields.io/badge/Project-Page-blue?style=flat&logo=Google%20chrome&logoColor=blue' alt='Project Page'> </a> <a href='https://ieeexplore.ieee.org/document/9792208' style='padding-left: 0.5rem;'> <img src='https://img.shields.io/badge/TPAMI-PDF-red?style=flat&logoColor=red' alt='TPAMI PDF'> </a> </p> </p> <br />This repo is a collection of resources on GAN inversion, as a supplement for our survey. If you find any work missing or have any suggestions (papers, implementations and other resources), feel free to pull requests. You could manually edit items or use the script to produce them in the markdown format.
<details style="margin-left:3%;"> <summary>citation</summary> <pre><code class="language-bib" style="font-size: 0.9rem;" id="citation">@article{xia2022gan, author = {Xia, Weihao and Zhang, Yulun and Yang, Yujiu and Xue, Jing-Hao and Zhou, Bolei and Yang, Ming-Hsuan}, title = {GAN Inversion: A Survey}, journal = {IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI)}, year={2022} } </code></pre> </details> <details><summary>Table of Contents</summary><p>- Inverted Pretrained Models
- GAN Inversion Methods
- Diffusion Inversion
- GAN Latent Space Editing
- Diffusion Latent Space Editing
- Applications
- Acknowledgement
Inverted Pretrained Models
2D GANs
Scaling up GANs for Text-to-Image Synthesis.<br> Minguk Kang, Jun-Yan Zhu, Richard Zhang, Jaesik Park, Eli Shechtman, Sylvain Paris, Taesung Park.<br> CVPR 2023 (Highlight). [PDF] [Project]
StyleGAN-T: Unlocking the Power of GANs for Fast Large-Scale Text-to-Image Synthesis.<br> Axel Sauer, Tero Karras, Samuli Laine, Andreas Geiger, Timo Aila.<br> ICML 2023. [Project] [PDF] [Code]
StyleGAN-XL: Scaling StyleGAN to Large Diverse Datasets.<br> Axel Sauer, Katja Schwarz, Andreas Geiger.<br> SIGGRAPH 2022. [PDF] [Project] [Code]
Self-Distilled StyleGAN: Towards Generation from Internet Photos.<br> Ron Mokady, Michal Yarom, Omer Tov, Oran Lang, Daniel Cohen-Or, Tali Dekel, Michal Irani, Inbar Mosseri.<br> SIGGRAPH 2022. [PDF] [Project] [Code]
Ensembling Off-the-shelf Models for GAN Training.<br> Nupur Kumari, Richard Zhang, Eli Shechtman, Jun-Yan Zhu<br> CVPR 2022. [PDF] [Project] [Code]
StyleGAN3: Alias-Free Generative Adversarial Networks.<br> Tero Karras, Miika Aittala, Samuli Laine, Erik Härkönen, Janne Hellsten, Jaakko Lehtinen, Timo Aila.<br> NeurIPS 2021. [PDF] [Project] [Code] [Rosinality]
StyleGAN2-Ada: Training Generative Adversarial Networks with Limited Data.<br> Tero Karras, Miika Aittala, Janne Hellsten, Samuli Laine, Jaakko Lehtinen, Timo Aila.<br> NeurIPS 2020. [PDF] [Code] [Steam StyleGAN2-ADA]
StyleGAN2: Analyzing and Improving the Image Quality of StyleGAN.<br> Tero Karras, Samuli Laine, Miika Aittala, Janne Hellsten, Jaakko Lehtinen, Timo Aila.<br> CVPR 2020. [PDF] [PyTorch] [Offical TF] [Unoffical Tensorflow 2.0]
StyleGAN: A Style-Based Generator Architecture for Generative Adversarial Networks.<br> Tero Karras, Samuli Laine, Timo Aila.<br> CVPR 2019. [PDF] [Offical TF]
ProGAN: Progressive Growing of GANs for Improved Quality, Stability, and Variation.<br> Tero Karras, Timo Aila, Samuli Laine, Jaakko Lehtinen.<br> ICLR 2018. [PDF] [Offical TF]
3D-aware GANs
Please check our 3D-aware image synthesis survey, paper list, and project for more details.
EG3D: Efficient Geometry-aware 3D Generative Adversarial Networks.<br> Eric R. Chan, Connor Z. Lin, Matthew A. Chan, Koki Nagano, Boxiao Pan, Shalini De Mello, Orazio Gallo, Leonidas Guibas, Jonathan Tremblay, Sameh Khamis, Tero Karras, Gordon Wetzstein.<br> CVPR 2022. [PDF] [Project] [Code]
StyleSDF: High-Resolution 3D-Consistent Image and Geometry Generation.<br> Roy Or-El, Xuan Luo, Mengyi Shan, Eli Shechtman, Jeong Joon Park, Ira Kemelmacher-Shlizerman.<br> CVPR 2022. [PDF] [Project] [Code]
StyleNeRF: A Style-based 3D-Aware Generator for High-resolution Image Synthesis.<br> Jiatao Gu, Lingjie Liu, Peng Wang, Christian Theobalt.<br> ICLR 2022. [PDF] [Project]
pi-GAN: Periodic Implicit Generative Adversarial Networks for 3D-Aware Image Synthesis.<br> Eric R. Chan, Marco Monteiro, Petr Kellnhofer, Jiajun Wu, Gordon Wetzstein.<br> CVPR 2021. [PDF] [Project] [Code]
GAN Inversion Methods
The section primarily encompasses general-purpose 2D or 3D inversion techniques, whereas the methods presented in the following section cater to particular applications.
3D GAN Inversion
TriPlaneNet: An Encoder for EG3D Inversion.<br> Ananta R. Bhattarai, Matthias Nießner, Artem Sevastopolsky.<br> WACV 2024. [PDF] [Project]
In-N-Out: Faithful 3D GAN Inversion with Volumetric Decomposition for Face Editing.<br> Yiran Xu, Zhixin Shu, Cameron Smith, Jia-Bin Huang, Seoung Wug Oh.<br> CVPR 2024. [PDF] [Project]
Make Encoder Great Again in 3D GAN Inversion through Geometry and Occlusion-Aware Encoding.<br> Ziyang Yuan, Yiming Zhu, Yu Li, Hongyu Liu, Chun Yuan.<br> ICCV 2023. [PDF] [Project] [Code]
LatentSwap3D: Semantic Edits on 3D Image GANs.<br> Enis Simsar, Alessio Tonioni, Evin Pınar Örnek, Federico Tombari.<br> ICCV 2023 Workshops on AI3DCC. [PDF] [Code]
High-fidelity 3D GAN Inversion by Pseudo-multi-view Optimization.<br> Jiaxin Xie, Hao Ouyang, Jingtan Piao, Chenyang Lei, Qifeng Chen.<br> CVPR 2023. [PDF] [Project] [Code]
Learning Detailed Radiance Manifolds for High-Fidelity and 3D-Consistent Portrait Synthesis from Monocular Image.<br> Yu Deng, Baoyuan Wang, Heung-Yeung Shum.<br> CVPR 2023. [PDF] [Project]
E3DGE: Self-Supervised Geometry-Aware Encoder for Style-based 3D GAN Inversion.<br> Yushi Lan, Xuyi Meng, Shuai Yang, Chen Change Loy, Bo Dai.<br> CVPR 2023. [PDF] [Project] [Code]
3D GAN Inversion with Pose Optimization.<br> Jaehoon Ko, Kyusun Cho, Daewon Choi, Kwangrok Ryoo, Seungryong Kim.<br> WACV 2023. [PDF] [Project] [Code]
3D GAN Inversion for Controllable Portrait Image Animation.<br> Connor Z. Lin, David B. Lindell, Eric R. Chan, Gordon Wetzstein.<br> ECCV 2022 Workshop on Learn3DG. [PDF] [Project]
Pix2NeRF: Unsupervised Conditional π-GAN for Single Image to Neural Radiance Fields Translation.<br> Shengqu Cai, Anton Obukhov, Dengxin Dai, Luc Van Gool.<br> CVPR 2022. [PDF]
2D GAN Inversion
StyleRes: Transforming the Residuals for Real Image Editing with StyleGAN.<br> Hamza Pehlivan, Yusuf Dalva, Aysegul Dundar.<br> CVPR 2023. [PDF] [Code]
ReGANIE: Rectifying GAN Inversion Errors for Accurate Real Image Editing.<br> Bingchuan Li, Tianxiang Ma, Peng Zhang, Miao Hua, Wei Liu, Qian He, Zili Yi.<br> AAAI 2023. [PDF]
Intra-Source Style Augmentation for Improved Domain Generalization.<br> Yumeng Li, Dan Zhang, Margret Keuper, Anna Khoreva.<br> WACV 2023. [PDF] [Code]
Pivotal Tuning for Latent-based Editing of Real Images.<br> Daniel Roich, Ron Mokady, Amit H. Bermano, Daniel Cohen-Or.<br> TOG 2022. [PDF] [Code]
E2Style: Improve the Efficiency and Effectiveness of StyleGAN Inversion.<br> Tianyi Wei, Dongdong Chen, Wenbo Zhou, Jing Liao, Weiming Zhang, Lu Yuan, Gang Hua, Nenghai Yu.<br> TIP 2022. [PDF] [Project] [Code]
High-fidelity GAN Inversion with Padding Space.<br> Qingyan Bai, Yinghao Xu, Jiapeng Zhu, Weihao Xia, Yujiu Yang, Yujun Shen.<br> ECCV 2022. [PDF] [Project] [Code]
Editing Out-of-Domain GAN Inversion via Differential Activations.<br> Haorui Song, Yong Du, Tianyi Xiang, Junyu Dong, Jing Qin, Shengfeng He.<br> ECCV 2022. [PDF] [Code]
IntereStyle: Encoding an Interest Region for Robust StyleGAN Inversion.<br> Seungjun Moon, GyeongMoon Park.<br> ECCV 2022. [PDF]
Chunkmogrify: Real image inversion via Segments.<br> David Futschik, Michal Lukáč, Eli Shechtman, Daniel Sýkora.<br> ECCV 2022. [PDF] [Code]
A Style-Based GAN Encoder for High Fidelity Reconstruction of Images and Videos.<br> Xu Yao, Alasdair Newson, Yann Gousseau, Pierre Hellier.<br> ECCV 2022. [PDF] [Code]
Cycle Encoding of a StyleGAN Encoder for Improved Reconstruction and Editability.<br> Xudong Mao, Liujuan Cao, Aurele Tohokantche Gnanha, Zhenguo Yang, Qing Li, Rongrong Ji.<br> ACM MM 2022. [PDF] [Code]
Spatially-Adaptive Multilayer Selection for GAN Inversion and Editing.<br> Gaurav Parmar, Yijun Li, Jingwan Lu, Richard Zhang, Jun-Yan Zhu, Krishna Kumar Singh.<br> CVPR 2022. [PDF] [Project] [Code]
Style Transformer for Image Inversion and Editing.<br> Xueqi Hu, Qiusheng Huang, Zhengyi Shi, Siyuan Li, Changxin Gao, Li Sun, Qingli Li.<br> CVPR 2022. [PDF] [Code]
High-Fidelity GAN Inversion for Image Attribute Editing.<br> Tengfei Wang, Yong Zhang, Yanbo Fan, Jue Wang, Qifeng Chen.<br> CVPR 2022. [PDF] [Project] [Code]
HyperInverter: Improving StyleGAN Inversion via Hypernetwork.<br> Tan M. Dinh, Anh Tuan Tran, Rang Nguyen, Binh-Son Hua.<br> CVPR 2022. [PDF] [Project]
HyperStyle: StyleGAN Inversion with HyperNetworks for Real Image Editing.<br> Yuval Alaluf, Omer Tov, Ron Mokady, Rinon Gal, Amit H. Bermano.<br> CVPR 2022. [PDF] [Project] [Code]
Overparameterization Improves StyleGAN Inversion.<br> Yohan Poirier-Ginter, Alexandre Lessard, Ryan Smith, Jean-François Lalonde.<br> CVPR 2022 Workshop on AI for Content Creation. [PDF] [Code]
StyleAlign: Analysis and Applications of Aligned StyleGAN Models.<br> Zongze Wu, Yotam Nitzan, Eli Shechtman, Dani Lischinski.<br> ICLR 2022. [PDF]
GAN-Control: Explicitly Controllable GANs.<br> Alon Shoshan, Nadav Bhonker, Igor Kviatkovsky, Gerard Medioni.<br> ICCV 2021. [PDF]
From Continuity to Editability: Inverting GANs with Consecutive Images.<br> Yangyang Xu, Yong Du, Wenpeng Xiao, Xuemiao Xu and Shengfeng He.<br> ICCV 2021. [PDF] [Code]
Explaining in Style: Training a GAN to explain a classifier in StyleSpace.<br> Oran Lang, Yossi Gandelsman, Michal Yarom, Yoav Wald, Gal Elidan, Avinatan Hassidim, William T. Freeman, Phillip Isola, Amir Globerson, Michal Irani, Inbar Mosseri.<br> ICCV 2021. [PDF] [Project]
BDInvert: GAN Inversion for Out-of-Range Images with Geometric Transformations.<br> Kyoungkook Kang, Seongtae Kim, Sunghyun Cho.<br> ICCV 2021. [PDF] [Project]
ReStyle: A Residual-Based StyleGAN Encoder via Iterative Refinement.<br> Yuval Alaluf, Or Patashnik, Daniel Cohen-Or.<br> ICCV 2021. [PDF] [Project] [Code]
LatentCLR: A Contrastive Learning Approach for Unsupervised Discovery of Interpretable Directions.<br> Oğuz Kaan Yüksel, Enis Simsar, Ezgi Gülperi Er, Pinar Yanardag.<br> ICCV 2021. [PDF] [Code]
Lifting 2D StyleGAN for 3D-Aware Face Generation.<br> Yichun Shi, Divyansh Aggarwal, Anil K. Jain.<br> CVPR 2021. [PDF]
Ensembling with Deep Generative Views.<br> Lucy Chai, Jun-Yan Zhu, Eli Shechtman, Phillip Isola, Richard Zhang.<br> CVPR 2021. [PDF] [Code] [Project]
Navigating the GAN Parameter Space for Semantic Image Editing.<br> Anton Cherepkov, Andrey Voynov, Artem Babenko.<br> CVPR 2021. [PDF] [Code]
StyleSpace Analysis: Disentangled Controls for StyleGAN Image Generation.<br> Zongze Wu, Dani Lischinski, Eli Shechtman.<br> CVPR 2021 (oral). [PDF] [Code]
Encoding in Style: a StyleGAN Encoder for Image-to-Image Translation.<br> Elad Richardson, Yuval Alaluf, Or Patashnik, Yotam Nitzan, Yaniv Azar, Stav Shapiro, Daniel Cohen-Or.<br> CVPR 2021. [PDF] [Code] [Project]
GHFeat: Generative Hierarchical Features from Synthesizing Images.<br> Yinghao Xu, Yujun Shen, Jiapeng Zhu, Ceyuan Yang, Bolei Zhou.<br> CVPR 2021. [PDF] [Code] [Project]
Hijack-GAN: Unintended-Use of Pretrained, Black-Box GANs.<br> Hui-Po Wang, Ning Yu, Mario Fritz.<br> CVPR 2021. [PDF]
Prior Image-Constrained Reconstruction using Style-Based Generative Models.<br> Varun A Kelkar, Mark Anastasio.<br> ICML 2021. [PDF]
Intermediate Layer Optimization for Inverse Problems using Deep Generative Models.<br> Giannis Daras, Joseph Dean, Ajil Jalal, Alexandros G. Dimakis.<br> ICML 2021. [PDF] [Code]
Using Latent Space Regression to Analyze and Leverage Compositionality in GANs.<br> Lucy Chai, Jonas Wulff, Phillip Isola.<br> ICLR 2021. [PDF] [Code] [Project] [Colab]
Enjoy Your Editing: Controllable GANs for Image Editing via Latent Space Navigation.<br> Peiye Zhuang, Oluwasanmi Koyejo, Alexander G. Schwing.<br> ICLR 2021. [PDF]
Disentangled Face Attribute Editing via Instance-Aware Latent Space Search.<br> Yuxuan Han, Jiaolong Yang, Ying Fu.<br> IJCAI 2021. [PDF] [Code]
High Fidelity GAN Inversion via Prior Multi-Subspace Feature Composition.<br> Guanyue Li, Qianfen Jiao, Sheng Qian, Si Wu, Hau-San Wong.<br> AAAI 2021. [PDF]
e4e: Designing an Encoder for StyleGAN Image Manipulation.<br> Omer Tov, Yuval Alaluf, Yotam Nitzan, Or Patashnik, Daniel Cohen-Or.<br> TOG 2021. [PDF] [Code]
StyleFlow: Attribute-conditioned Exploration of StyleGAN-Generated Images using Conditional Continuous Normalizing Flows.<br> Rameen Abdal, Peihao Zhu, Niloy Mitra, Peter Wonka.<br> TOG 2021. [PDF] [Code]
PIE: Portrait Image Embedding for Semantic Control.<br> A. Tewari, M. Elgharib, M. BR, F. Bernard, H-P. Seidel, P. Pérez, M. Zollhöfer, C.Theobalt.<br> TOG 2020. [PDF] [Project]
Face Identity Disentanglement via Latent Space Mapping.<br> Yotam Nitzan, Amit Bermano, Yangyan Li, Daniel Cohen-Or.<br> SIGGRAPH Asia 2020. [PDF] [Project] [Code]
Understanding the Role of Individual Units in a Deep Neural Network.<br> David Bau, Jun-Yan Zhu, Hendrik Strobelt, Agata Lapedriza, Bolei Zhou, Antonio Torralba.<br> National Academy of Sciences 2020. [PDF] [Code] [Project]
Face Identity Disentanglement via Latent Space Mapping.<br> Yotam Nitzan, Amit Bermano, Yangyan Li, Daniel Cohen-Or.<br> TOG 2020. [PDF] [Code]
Transforming and Projecting Images into Class-conditional Generative Networks.<br> Minyoung Huh, Richard Zhang, Jun-Yan Zhu, Sylvain Paris, Aaron Hertzmann.<br> ECCV 2020. [PDF] [Code] [Project]
MimicGAN: Robust Projection onto Image Manifolds with Corruption Mimicking.<br> Rushil Anirudh, Jayaraman J. Thiagarajan, Bhavya Kailkhura, Timo Bremer.<br> IJCV 2020. [PDF]
Rewriting a Deep Generative Model.<br> David Bau, Steven Liu, Tongzhou Wang, Jun-Yan Zhu, Antonio Torralba.<br> ECCV 2020. [PDF] [Code]
StyleGAN2 Distillation for Feed-forward Image Manipulation.<br> Yuri Viazovetskyi, Vladimir Ivashkin, Evgeny Kashin.<br> ECCV 2020. [PDF] [Code]
In-Domain GAN Inversion for Real Image Editing.<br> Jiapeng Zhu, Yujun Shen, Deli Zhao, Bolei Zhou.<br> ECCV 2020. [PDF] [Project] [Code]
Exploiting Deep Generative Prior for Versatile Image Restoration and Manipulation.<br> Xingang Pan, Xiaohang Zhan, Bo Dai, Dahua Lin, Chen Change Loy, Ping Luo.<br> ECCV 2020. [PDF] [Code]
Your Local GAN: Designing Two Dimensional Local Attention Mechanisms for Generative Models.<br> Giannis Daras, Augustus Odena, Han Zhang, Alexandros G. Dimakis.<br> CVPR 2020. [PDF]
A Disentangling Invertible Interpretation Network for Explaining Latent Representations.<br> Patrick Esser, Robin Rombach, Björn Ommer.<br> CVPR 2020. [PDF] [Project] [Code]
Editing in Style: Uncovering the Local Semantics of GANs.<br> Edo Collins, Raja Bala, Bob Price, Sabine Süsstrunk.<br> CVPR 2020. [PDF] [Code]
Image Processing Using Multi-Code GAN Prior.<br> Jinjin Gu, Yujun Shen, Bolei Zhou.<br> CVPR 2020. [PDF] [Project] [Code]
Image2StyleGAN++: How to Edit the Embedded Images?<br> Rameen Abdal, Yipeng Qin, Peter Wonka.<br> CVPR 2020. [PDF]
Semantic Photo Manipulation with a Generative Image Prior.<br> David Bau, Hendrik Strobelt, William Peebles, Jonas, Bolei Zhou, Jun-Yan Zhu, Antonio Torralba.<br> TOG 2019. [PDF]
Image2StyleGAN: How to Embed Images Into the StyleGAN Latent Space?<br> Rameen Abdal, Yipeng Qin, Peter Wonka.<br> ICCV 2019. [PDF] [Code]
GAN-based Projector for Faster Recovery with Convergence Guarantees in Linear Inverse Problems.<br> Ankit Raj, Yuqi Li, Yoram Bresler.<br> ICCV 2019. [PDF]
Inverting Layers of a Large Generator.<br> David Bau, Jun-Yan Zhu, Jonas Wulff, William Peebles, Hendrik Strobelt, Bolei Zhou, Antonio Torralba.<br> ICCV 2019. [PDF]
Detecting Overfitting in Deep Generators via Latent Recovery.<br> Ryan Webster, Julien Rabin, Loic Simon, Frederic Jurie.<br> CVPR 2019. [PDF][Colab]
Inverting The Generator Of A Generative Adversarial Network (II).<br> Antonia Creswell, Anil A Bharath.<br> TNNLS 2018. [PDF] [Code]
Invertibility of Convolutional Generative Networks from Partial Measurements.<br> Fangchang Ma, Ulas Ayaz, Sertac Karaman.<br> NeurIPS 2018. [PDF] [Code]
Metrics for Deep Generative Models.<br> Nutan Chen, Alexej Klushyn, Richard Kurle, Xueyan Jiang, Justin Bayer, Patrick van der Smagt.<br> AISTATS 2018. [PDF]
Towards Understanding the Invertibility of Convolutional Neural Networks.<br> Anna C. Gilbert, Yi Zhang, Kibok Lee, Yuting Zhang, Honglak Lee.<br> IJCAI 2017. [PDF]
One Network to Solve Them All - Solving Linear Inverse Problems using Deep Projection Models.<br> J. H. Rick Chang, Chun-Liang Li, Barnabas Poczos, B. V. K. Vijaya Kumar, Aswin C. Sankaranarayanan.<br> ICCV 2017. [PDF]
Precise Recovery of Latent Vectors from Generative Adversarial Networks.<br> Zachary C. Lipton, Subarna Tripathi.<br> ICLR 2017 workshop. [PDF] [Code]
Inverting The Generator Of A Generative Adversarial Network.<br> Antonia Creswell, Anil Anthony Bharath.<br> NeurIPS 2016 Workshop. [PDF]
Generative Visual Manipulation on the Natural Image Manifold.<br> Jun-Yan Zhu, Philipp Krähenbühl, Eli Shechtman, Alexei A. Efros.<br> ECCV 2016. [PDF]
Improved StyleGAN Embedding: Where Are The Good Latents?.<br> Peihao Zhu, Rameen Abdal, Yipeng Qin, John Femiani, Peter Wonka.<br> arxiv 2020. [PDF] [Code]
Improving Inversion and Generation Diversity in StyleGAN Using A Gaussianized Latent Space.<br> Jonas Wulff, Antonio Torralba.<br> arxiv 2020. [PDF]
<p width="100%" align="right"><a href="#">🔝</a></p>GAN Latent Space Editing
Inversion isn't the ultimate goal; it's a means to enable real image or video editing within a GAN's latent space. Commonly referred to as GAN latent space editing, navigation, traversal, steerability, or other names in the literature, this task, although sometimes seen as a standalone research domain, acts as an indispensable component of GAN inversion based editing. This section is about GAN latent space editing. Recent studies reveal diffusion model can be used for real image and video editing in a similar way. Please refer to Diffusion Inversion and Diffusion Latent Space Editing for more details.
Deep Curvilinear Editing: Commutative and Nonlinear Image Manipulation for Pretrained Deep Generative Model.<br> Takehiro Aoshima, Takashi Matsubara.<br> CVPR 2023. [PDF]
Fantastic Style Channels and Where to Find Them: A Submodular Framework for Discovering Diverse Directions in GANs.<br> Enis Simsar, Umut Kocasari, Ezgi Gülperi Er, Pinar Yanardag.<br> WACV 2023. [PDF] [Project] [Demo]
Latent Traversals in Generative Models as Potential Flows.<br> Yue Song, Andy Keller, Nicu Sebe, Max Welling.<br> ICML 2023. [PDF] [Code]
PandA: Unsupervised Learning of Parts and Appearances in the Feature Maps of GANs.<br> James Oldfield, Christos Tzelepis, Yannis Panagakis, Mihalis A. Nicolaou, Ioannis Patras.<br> ICLR 2023. [PDF] [Project] [Code]
Rayleigh EigenDirections (REDs): GAN Latent Space Traversals for Multidimensional Features.<br> Guha Balakrishnan, Raghudeep Gadde, Aleix Martinez, Pietro Perona.<br> ECCV 2022. [PDF]
Rayleigh EigenDirections (REDs): Nonlinear GAN Latent Space Traversals for Multidimensional Features.<br> Guha Balakrishnan, Raghudeep Gadde, Aleix Martinez, Pietro Perona.<br> ECCV 2022. [PDF]
Exploring Gradient-based Multi-directional Controls in GANs.<br> Zikun Chen, Ruowei Jiang, Brendan Duke, Han Zhao, Parham Aarabi.<br> ECCV 2022 (oral). [PDF] [Project]
Hierarchical Semantic Regularization of Latent Spaces in StyleGANs.<br> Tejan Karmali, Rishubh Parihar, Susmit Agrawal, Harsh Rangwani, Varun Jampani, Maneesh Singh, R. Venkatesh Babu.<br> ECCV 2022. [PDF] [Project]
CLIP2StyleGAN: Unsupervised Extraction of StyleGAN Edit Directions.<br> Rameen Abdal, Peihao Zhu, John Femiani, Niloy J. Mitra, Peter Wonka.<br> SIGGRAPH 2022. [PDF] [Code]
Region-Based Semantic Factorization in GANs.<br> Jiapeng Zhu, Yujun Shen, Yinghao Xu, Deli Zhao, Qifeng Chen.<br> ICML 2022. [PDF] [Code]
Latent Image Animator: Learning to Animate Image via Latent Space Navigation.<br> Yaohui Wang, Di Yang, Francois Bremond, Antitza Dantcheva.<br> ICLR 2022. [PDF] [Project] [Code]
Do Not Escape From the Manifold: Discovering the Local Coordinates on the Latent Space of GANs.<br> Jaewoong Choi, Changyeon Yoon, Junho Lee, Jung Ho Park, Geonho Hwang, Myungjoo Kang.<br> ICLR 2022. [PDF]
Tensor-based Emotion Editing in the StyleGAN Latent Space.<br> René Haas, Stella Graßhof, Sami S. Brandt.<br> CVPR 2022 Workshop on AI for Content Creation. [PDF]
PaintInStyle: One-Shot Discovery of Interpretable Directions by Painting.<br> Berkay Doner, Elif Sema Balcioglu, Merve Rabia Barin, Umut Kocasari, Mert Tiftikci, Pinar Yanardag.<br> CVPR 2022 Workshop. [PDF]
Rank in Style: A Ranking-Based Approach To Find Interpretable Directions.<br> Umut Kocasari, Kerem Zaman, Mert Tiftikci, Enis Simsar, Pinar Yanardag.<br> CVPR 2022 Workshop. [PDF]
LARGE: Latent-Based Regression through GAN Semantics.<br> Yotam Nitzan, Rinon Gal, Ofir Brenner, Daniel Cohen-Or.<br> CVPR 2022. [PDF] [Code] [Project]
StyleFusion: Disentangling Spatial Segments in StyleGAN-Generated Images.<br> Omer Kafri, Or Patashnik, Yuval Alaluf, Daniel Cohen-Or.<br> TOG 2022. [PDF] [Code]
Optimizing Latent Space Directions For GAN-based Local Image Editing.<br> Ehsan Pajouheshgar, Tong Zhang, Sabine Süsstrunk.<br> ICASSP 2022. [PDF]
Tensor Component Analysis for Interpreting the Latent Space of GANs.<br> James Oldfield, Markos Georgopoulos, Yannis Panagakis, Mihalis A. Nicolaou, Ioannis Patras.<br> BMVC 2021. [PDF] [Project] [Code]
Tensor-based Subspace Factorization for StyleGAN.<br> Rene Haas, Stella Graßhof and Sami S. Brandt.<br> FG 2021. [PDF]
Exploratory Search of GANs with Contextual Bandits.<br> Ivan Kropotov, Alan Medlar, Dorota Glowacka.<br> CIKM 2021. [PDF]
LowRankGAN: Low-Rank Subspaces in GANs.<br> Jiapeng Zhu, Ruili Feng, Yujun Shen, Deli Zhao, Zhengjun Zha, Jingren Zhou, Qifeng Chen.<br> NeurIPS 2021. [PDF] [Code]
Controllable and Compositional Generation with Latent-Space Energy-Based Models.<br> Weili Nie, Arash Vahdat, Anima Anandkumar.<br> NeurIPS 2021. [PDF]
A Latent Transformer for Disentangled Face Editing in Images and Videos.<br> Xu Yao, Alasdair Newson, Yann Gousseau, Pierre Hellier.<br> ICCV 2021. [PDF] [arxiv] [Code]
Toward a Visual Concept Vocabulary for GAN Latent Space.<br> Sarah Schwettmann, Evan Hernandez, David Bau, Samuel Klein, Jacob Andreas, Antonio Torralba.<br> ICCV 2021. [PDF] [Project]
WarpedGANSpace: Finding Non-linear RBF Paths in GAN Latent Space.<br> Christos Tzelepis, Georgios Tzimiropoulos, Ioannis Patras.<br> ICCV 2021. [PDF] [Code]
Latent Transformations via NeuralODEs for GAN-based Image Editing.<br> Valentin Khrulkov, Leyla Mirvakhabova, Ivan Oseledets, Artem Babenko.<br> ICCV 2021. [PDF] [Code]
OroJaR: Orthogonal Jacobian Regularization for Unsupervised Disentanglement in Image Generation.<br> Yuxiang Wei, Yupeng Shi, Xiao Liu, Zhilong Ji, Yuan Gao, Zhongqin Wu, Wangmeng Zuo.<br> ICCV 2021. [PDF] [Code]
EigenGAN: Layer-Wise Eigen-Learning for GANs.<br> Zhenliang He, Meina Kan, Shiguang Shan.<br> ICCV 2021. [PDF] [Code]
SSFlow: Style-guided Neural Spline Flows for Face Image Manipulation.<br> Hanbang Liang, Xianxu Hou, Linlin Shen.<br> ACM MM 2021. [PDF]
SalS-GAN: Spatially-Adaptive Latent Space in StyleGAN for Real Image Embedding.<br> Lingyun Zhang, Xiuxiu Bai, Yao Gao.<br> ACM MM 2021. [PDF]
Discovering Density-Preserving Latent Space Walks in GANs for Semantic Image Transformations.<br> Guanyue Li, Yi Liu, Xiwen Wei, Yang Zhang, Si Wu, Yong Xu, Hau San Wong.<br> ACM MM 2021. [PDF]
Discovering Interpretable Latent Space Directions of GANs Beyond Binary Attributes.<br> Huiting Yang, Liangyu Chai, Qiang Wen, Shuang Zhao, Zixun Sun, Shengfeng He.<br> CVPR 2021. [PDF] [Code]
Surrogate Gradient Field for Latent Space Manipulation.<br> Minjun Li, Yanghua Jin, Huachun Zhu.<br> CVPR 2021. [PDF]
SeFa: Closed-Form Factorization of Latent Semantics in GANs.<br> Yujun Shen, Bolei Zhou.<br> CVPR 2021. [PDF] [Code] [Project]
L2M-GAN: Learning To Manipulate Latent Space Semantics for Facial Attribute Editing.<br> Guoxing Yang, Nanyi Fei, Mingyu Ding, Guangzhen Liu, Zhiwu Lu, Tao Xiang.<br> CVPR 2021. [PDF] [Unofficial Pytorch]
MoCoGAN-HD: A Good Image Generator Is What You Need for High-Resolution Video Synthesis.<br> Yu Tian, Jian Ren, Menglei Chai, Kyle Olszewski, Xi Peng, Dimitris N. Metaxas, Sergey Tulyakov.<br> ICLR 2021. [PDF] [Code]
GAN Steerability without optimization.<br> Nurit Spingarn-Eliezer, Ron Banner, Tomer Michaeli.<br> ICLR 2021. [OpenReview] [PDF]
On the "steerability" of generative adversarial networks.<br> Ali Jahanian, Lucy Chai, Phillip Isola.<br> ICLR 2020. [PDF] [Project]
GANSpace: Discovering Interpretable GAN Controls.<br> Erik Härkönen, Aaron Hertzmann, Jaakko Lehtinen, Sylvain Paris.<br> NeurIPS 2020. [PDF] [Code]
Interpreting the Latent Space of GANs for Semantic Face Editing.<br> Yujun Shen, Jinjin Gu, Xiaoou Tang, Bolei Zhou.<br> CVPR 2020. [PDF] [Project] [Code]
Seeing What a GAN Cannot Generate.<br> David Bau, Jun-Yan Zhu, Jonas Wulff, William Peebles, Hendrik Strobelt, Bolei Zhou, Antonio Torralba.<br> ICCV 2019. [PDF] [PDF]
Unsupervised Discovery of Interpretable Directions in the GAN Latent Space.<br> Andrey Voynov, Artem Babenko.<br> ICML 2020. [PDF] [Code]
Multi-level Latent Space Structuring for Generative Control.<br> Oren Katzir, Vicky Perepelook, Dani Lischinski, Daniel Cohen-Or.<br> arxiv 2022. [PDF]
<p width="100%" align="right"><a href="#">🔝</a></p>Diffusion Inversion
An Edit Friendly DDPM Noise Space: Inversion and Manipulations.<br> Inbar Huberman-Spiegelglas, Vladimir Kulikov, Tomer Michaeli.<br> CVPR 2024. [PDF] [Project] [Code]
Direct Inversion: Boosting Diffusion-based Editing with 3 Lines of Code.<br> Xuan Ju, Ailing Zeng, Yuxuan Bian, Shaoteng Liu, Qiang Xu.<br> ICLR 2024. [PDF] [Code]
NULL-text Inversion for Editing Real Images using Guided Diffusion Models.<br> Ron Mokady, Amir Hertz, Kfir Aberman, Yael Pritch, Daniel Cohen-Or.<br> CVPR 2023. [PDF] [Project] [Code]
EDICT: Exact Diffusion Inversion via Coupled Transformations.<br> Bram Wallace, Akash Gokul, Nikhil Naik.<br> CVPR 2023. [PDF] [Code]
An Image is Worth One Word: Personalizing Text-to-Image Generation using Textual Inversion.<br> Rinon Gal, Yuval Alaluf, Yuval Atzmon, Or Patashnik, Amit H. Bermano, Gal Chechik, Daniel Cohen-Or.<br> ICLR 2023 (Oral). [PDF] [Project] [Code]
Prompt-to-Prompt Image Editing with Cross Attention Control.<br> Amir Hertz, Ron Mokady, Jay Tenenbaum, Kfir Aberman, Yael Pritch, Daniel Cohen-Or.<br> ICLR 2023. [PDF] [Project] [Code]
A Neural Space-Time Representation for Text-to-Image Personalization.<br> Yuval Alaluf, Elad Richardson, Gal Metzer, Daniel Cohen-Or.<br> SIGGRAPH Asia 2023. [PDF] [Project]
Discovering Interpretable Directions in the Semantic Latent Space of Diffusion Models.<br> René Haas, Inbar Huberman-Spiegelglas, Rotem Mulayoff, and Tomer Michaeli.<br> arxiv 2023. [PDF]
Diffusion Latent Space Editing
Semantic Editing in Diffusion Latent Spaces.
Self-Discovering Interpretable Diffusion Latent Directions for Responsible Text-to-Image Generation.<br> Hang Li, Chengzhi Shen, Philip Torr, Volker Tresp, Jindong Gu.<br> CVPR 2024. [PDF] [Project] [Code]
NoiseCLR: A Contrastive Learning Approach for Unsupervised Discovery of Interpretable Directions in Diffusion Models.<br> Yusuf Dalva, Pinar Yanardag.<br> CVPR 2024 (Oral). [PDF] [Project]
An Edit Friendly DDPM Noise Space: Inversion and Manipulations.<br> Inbar Huberman-Spiegelglas, Vladimir Kulikov, Tomer Michaeli.<br> CVPR 2024. [PDF] [Code]
Diffusion Models Already Have A Semantic Latent Space.<br> Mingi Kwon, Jaeseok Jeong, Youngjung Uh.<br> ICLR 2023. [PDF]
Unifying Diffusion Models' Latent Space, with Applications to CycleDiffusion and Guidance.<br> Chen Henry Wu, Fernando De la Torre.<br> ICCV 2023. [PDF] [Code]
Generative Visual Prompt: Unifying Distributional Control of Pre-Trained Generative Models.<br> Chen Henry Wu, Saman Motamed, Shaunak Srivastava, Fernando De la Torre.<br> NeurIPS 2022. [PDF]
$P+$: Extended Textual Conditioning in Text-to-Image Generation.<br> Andrey Voynov, Qinghao Chu, Daniel Cohen-Or, Kfir Aberman.<br> arxiv 2023. [PDF] [Project]
<p width="100%" align="right"><a href="#">🔝</a></p>Applications
Image and Video Generation and Manipulation
Diffusion-driven GAN Inversion for Multi-Modal Face Image Generation.<br> Jihyun Kim, Changjae Oh, Hoseok Do, Soohyun Kim, Kwanghoon Sohn.<br> CVPR 2024. [PDF] [Project]
The Devil is in the Details: StyleFeatureEditor for Detail-Rich StyleGAN Inversion and High Quality Image Editing.<br> Denis Bobkov, Vadim Titov, Aibek Alanov, Dmitry Vetrov.<br> CVPR 2024. [PDF] [Project]
Robust One-Shot Face Video Re-enactment using Hybrid Latent Spaces of StyleGAN2.<br> Trevine Oorloff, Yaser Yacoob.<br> ICCV 2023. [PDF] [Project]
Expressive Talking Head Video Encoding in StyleGAN2 Latent-Space.<br> Trevine Oorloff, Yaser Yacoob.<br> ICCVW 2023. [PDF] [Project] [Code] [Data]
CLIP-Guided StyleGAN Inversion for Text-Driven Real Image Editing.<br> Ahmet Canberk Baykal, Abdul Basit Anees, Duygu Ceylan, Erkut Erdem, Aykut Erdem, Deniz Yuret.<br> TOG 2023. [PDF]
NeRFInvertor: High Fidelity NeRF-GAN Inversion for Single-shot Real Image Animation.<br> Yu Yin, Kamran Ghasedi, HsiangTao Wu, Jiaolong Yang, Xin Tong, Yun Fu.<br> CVPR 2023. [PDF]
Fine-Grained Face Swapping via Regional GAN Inversion.<br> Zhian Liu, Maomao Li, Yong Zhang, Cairong Wang, Qi Zhang, Jue Wang, Yongwei Nie.<br> CVPR 2023. [PDF] [Project]
VIVE3D: Viewpoint-Independent Video Editing using 3D-Aware GANs.<br> Anna Frühstück, Nikolaos Sarafianos, Yuanlu Xu, Peter Wonka, Tony Tung.<br> CVPR 2023. [PDF] [Project] [Code]
Delving StyleGAN Inversion for Image Editing: A Foundation Latent Space Viewpoint.<br> Hongyu Liu, Yibing Song, Qifeng Chen.<br> CVPR 2023. [PDF] [Project] [Code]
Balancing Reconstruction and Editing Quality of GAN Inversion for Real Image Editing with StyleGAN Prior Latent Space.<br> Kai Katsumata, Duc Minh Vo, Bei Liu, Hideki Nakayama.<br> CVPR 2023 Workshop on AI4CC. [PDF]
Modeling the Latent Dynamics of StyleGAN using Neural ODEs.<br> Weihao Xia, Yujiu Yang, Jing-Hao Xue.<br> NeurIPSW 2023. [PDF] [Code]
Dr.3D: Adapting 3D GANs to Artistic Drawings.<br> Wonjoon Jin, Nuri Ryu, Geonung Kim, Seung-Hwan Baek, Sunghyun Cho.<br> SIGGRAPH Asia 2022. [PDF] [Project]
Stitch it in Time: GAN-Based Facial Editing of Real Videos.<br> Rotem Tzaban, Ron Mokady, Rinon Gal, Amit H. Bermano, Daniel Cohen-Or.<br> SIGGRAPH Asia 2022. [PDF] [Project] [Code]
DyStyle: Dynamic Neural Network for Multi-Attribute-Conditioned Style Editing.<br> Bingchuan Li, Shaofei Cai, Wei Liu, Peng Zhang, Miao Hua, Qian He, Zili Yi.<br> WACV 2023. [PDF] [Code]
Generative Visual Prompt: Unifying Distributional Control of Pre-Trained Generative Models.<br> Chen Henry Wu, Saman Motamed, Shaunak Srivastava, Fernando De la Torre.<br> NeurIPS 2022. [PDF] [Code]
Generalized One-shot Domain Adaption of Generative Adversarial Networks.<br> Zicheng Zhang, Yinglu Liu, Congying Han, Tiande Guo, Ting Yao, Tao Mei.<br> NeurIPS 2022. [PDF] [Code]
3D-FM GAN: Towards 3D-Controllable Face Manipulation.<br> Yuchen Liu, Zhixin Shu, Yijun Li, Zhe Lin, Richard Zhang, and Sun-Yuan Kung.<br> ECCV 2022. [PDF] [Project]
JoJoGAN: One Shot Face Stylization.<br> Min Jin Chong, David Forsyth.<br> ECCV 2022. [PDF] [Code]
Generative Multiplane Images: Making a 2D GAN 3D-Aware.<br> Xiaoming Zhao, Fangchang Ma, David Güera, Zhile Ren, Alexander G. Schwing, Alex Colburn.<br> ECCV 2022. [PDF] [Project] [Code]
Injecting 3D Perception of Controllable NeRF-GAN into StyleGAN for Editable Portrait Image Synthesis.<br> Jeong-gi Kwak, Yuanming Li, Dongsik Yoon, Donghyeon Kim, David Han, Hanseok Ko.<br> ECCV 2022. [PDF] [Project]
Temporally Consistent Semantic Video Editing.<br> Yiran Xu, Badour AlBahar, Jia-Bin Huang.<br> ECCV 2022. [PDF] [Project]
Sound-Guided Semantic Video Generation.<br> Seung Hyun Lee, Gyeongrok Oh, Wonmin Byeon, Jihyun Bae, Chanyoung Kim, Won Jeong Ryoo, Sang Ho Yoon, Jinkyu Kim, Sangpil Kim.<br> ECCV 2022. [PDF] [Project] [Code]
Third Time's the Charm? Image and Video Editing with StyleGAN3.<br> Yuval Alaluf, Or Patashnik, Zongze Wu, Asif Zamir, Eli Shechtman, Dani Lischinski, Daniel Cohen-Or.<br> ECCV 2022 Workshop on Advances in Image Manipulation. [PDF] [Project] [Code]
Everything is There in Latent Space: Attribute Editing and Attribute Style Manipulation by StyleGAN Latent Space Exploration.<br> Rishubh Parihar, Ankit Dhiman, Tejan Karmali, R. Venkatesh Babu.<br> ACM MM 2022. [PDF] [Project]
SD-GAN: Semantic Decomposition for Face Image Synthesis with Discrete Attribute.<br> Zhou Kangneng, Zhu Xiaobin, Gao Daiheng, Lee Kai, Li Xinjie, Yin Xu-Cheng.<br> ACM MM 2022. [PDF]
Identity-Guided Face Generation with Multi-modal Contour Conditions.<br> Qingyan Bai, Weihao Xia, Fei Yin, Yujiu Yang.<br> ICIP 2022. [PDF]
Self-Conditioned Generative Adversarial Networks for Image Editing.<br> Yunzhe Liu, Rinon Gal, Amit H. Bermano, Baoquan Chen, Daniel Cohen-Or.<br> SIGGRAPH 2022. [PDF] [Project]
StyleGAN-NADA: CLIP-Guided Domain Adaptation of Image Generators.<br> Rinon Gal, Or Patashnik, Haggai Maron, Gal Chechik, Daniel Cohen-Or.<br> SIGGRAPH 2022. [PDF] [Project] [Code]
SphericGAN: Semi-Supervised Hyper-Spherical Generative Adversarial Networks for Fine-Grained Image Synthesis.<br> Tianyi Chen, Yunfei Zhang, Xiaoyang Huo, Si Wu, Yong Xu, Hau San Wong.<br> CVPR 2022. [PDF]
Sound-Guided Semantic Image Manipulation.<br> Seung Hyun Lee, Wonseok Roh, Wonmin Byeon, Sang Ho Yoon, Chan Young Kim, Jinkyu Kim, Sangpil Kim.<br> CVPR 2022. [PDF]
HairCLIP: Design Your Hair by Text and Reference Image.<br> Tianyi Wei, Dongdong Chen, Wenbo Zhou, Jing Liao, Zhentao Tan, Lu Yuan, Weiming Zhang, Nenghai Yu.<br> CVPR 2022. [PDF] [Code]
HairMapper: Removing Hair from Portraits Using GANs.<br> Yiqian Wu, Yong-Liang Yang, Xiaogang Jin.<br> CVPR 2022. [PDF] [Project] [Code] [Non-hair-FFHQ Data]
Attribute Group Editing for Reliable Few-shot Image Generation.<br> Guanqi Ding, Xinzhe Han, Shuhui Wang, Shuzhe Wu, Xin Jin, Dandan Tu, Qingming Huang.<br> CVPR 2022. [PDF] [Code]
InsetGAN for Full-Body Image Generation.<br> Anna Frühstück, Krishna Kumar Singh, Eli Shechtman, Niloy J. Mitra, Peter Wonka, Jingwan Lu.<br> CVPR 2022. [PDF] [Project] [Unofficial]
SpaceEdit: Learning a Unified Editing Space for Open-Domain Image Editing.<br> Jing Shi, Ning Xu, Haitian Zheng, Alex Smith, Jiebo Luo, Chenliang Xu.<br> CVPR 2022. [PDF]
In&Out: Diverse Image Outpainting via GAN Inversion.<br> Yen-Chi Cheng, Chieh Hubert Lin, Hsin-Ying Lee, Jian Ren, Sergey Tulyakov, Ming-Hsuan Yang.<br> CVPR 2022. [PDF] [Code]
InfinityGAN: Towards Infinite-Resolution Image Synthesis.<br> Chieh Hubert Lin, Hsin-Ying Lee, Yen-Chi Cheng, Sergey Tulyakov, Ming-Hsuan Yang.<br> ICLR 2022. [PDF] [Project]
Latent to Latent: A Learned Mapper for Identity Preserving Editing of Multiple.<br> Siavash Khodadadeh, Shabnam Ghadar, Saeid Motiian, Wei-An Lin, Ladislau Bölöni, Ratheesh Kalarot.<br> WACV 2022. [PDF]
StyleVideoGAN: A Temporal Generative Model using a Pretrained StyleGAN.<br> Gereon Fox, Ayush Tewari, Mohamed Elgharib, Christian Theobalt.<br> BMVC 2021 (Oral). [PDF]
Face Image Retrieval With Attribute Manipulation.<br> Alireza Zaeemzadeh, Shabnam Ghadar, Baldo Faieta, Zhe Lin, Nazanin Rahnavard, Mubarak Shah, Ratheesh Kalarot.<br> ICCV 2021. [PDF]
StyleCariGAN: Caricature Generation via StyleGAN Feature Map Modulation.<br> Wongjong Jang, Gwangjin Ju, Yucheol Jung, Jiaolong Yang, Xin Tong, Seungyong Lee.<br> TOG 2021. [PDF] [Code]
Coarse-to-Fine: Facial Structure Editing of Portrait Images via Latent Space Classifications.<br> Yiqian Wu, Yongliang Yang, Qinjie Xiao, Xiaogang Ji.<br> TOG 2021. [PDF] [Project]
SAM: Only a Matter of Style-Age Transformation Using a Style-Based Regression Model.<br> Yuval Alaluf, Or Patashnik, Daniel Cohen-Or.<br> TOG 2021. [PDF] [Code]
Barbershop: GAN-based Image Compositing using Segmentation Masks.<br> Peihao Zhu, Rameen Abdal, John Femiani, Peter Wonka.<br> SIGGRAPH Asia 2021. [PDF] [Project] [Code]
Constrained Graphic Layout Generation via Latent Optimization.<br> Kotaro Kikuchi, Edgar Simo-Serra, Mayu Otani, Kota Yamaguchi.<br> ACM MM 2021. [PDF] [Code]
CI-GAN: Cycle-Consistent Inverse GAN for Text-to-Image Synthesis.<br> Hao Wang, Guosheng Lin, Steven C. H. Hoi, Chunyan Miao.<br> ACM MM 2021. [PDF]
Exploring Adversarial Fake Images on Face Manifold.<br> Dongze Li, Wei Wang, Hongxing Fan, Jing Dong.<br> CVPR 2021. [PDF]
HistoGAN: Controlling Colors of GAN-Generated and Real Images via Color Histograms.<br> Mahmoud Afifi, Marcus A. Brubaker, Michael S. Brown.<br> CVPR 2021. [PDF] [Code] [4K Landscape]
One Shot Face Swapping on Megapixels.<br> Yuhao Zhu, Qi Li, Jian Wang, Chengzhong Xu, Zhenan Sun.<br> CVPR 2021. [PDF] [Code]
LOHO: Latent Optimization of Hairstyles via Orthogonalization.<br> Rohit Saha, Brendan Duke, Florian Shkurti, Graham W. Taylor, Parham Aarabi.<br> CVPR 2021. [PDF] [Code]
StyleMapGAN: Exploiting Spatial Dimensions of Latent in GAN for Real-time Image Editing.<br> Hyunsu Kim, Yunjey Choi, Junho Kim, Sungjoo Yoo, Youngjung Uh.<br> CVPR 2021. [PDF] [Code]
TediGAN: Text-Guided Diverse Image Generation and Manipulation.<br> Weihao Xia, Yujiu Yang, Jing-Hao Xue, Baoyuan Wu.<br> CVPR 2021. [PDF] [Data] [Code]
DeepI2I: Enabling Deep Hierarchical Image-to-Image Translation by Transferring from GANs.<br> yaxing wang, Lu Yu, Joost van de Weijer.<br> NeurIPS 2020. [PDF] [Code]
DeepLandscape: Adversarial Modeling of Landscape Videos.<br> E. Logacheva, R. Suvorov, O. Khomenko, A. Mashikhin, and V. Lempitsky.<br> ECCV 2020. [PDF] [Code] [Project]
Few-shot Semantic Image Synthesis Using StyleGAN Prior.<br> Yuki Endo, Yoshihiro Kanamori.<br> arxiv 2021. [PDF] [Code]
Paint by Word.<br> David Bau, Alex Andonian, Audrey Cui, YeonHwan Park, Ali Jahanian, Aude Oliva, Antonio Torralba.<br> arxiv 2021. [PDF] [Project] [Project]
Image Restoration
LTT-GAN: Looking Through Turbulence by Inverting GANs.<br> Kangfu Mei, Vishal M. Patel.<br> J-STSP 2023. [PDF] [Project]
Semantic Uncertainty Intervals for Disentangled Latent Spaces.<br> Swami Sankaranarayanan, Anastasios N. Angelopoulos, Stephen Bates, Yaniv Romano, Phillip Isola.<br> NeurIPS 2022. [PDF] [Code]
High-Fidelity Image Inpainting with GAN Inversion.<br> Yongsheng Yu, Libo Zhang, Heng Fan, Tiejian Luo.<br> ECCV 2022. [PDF]
Escaping Data Scarcity for High-Resolution Heterogeneous Face Hallucination.<br> Yiqun Mei, Pengfei Guo, Vishal M. Patel.<br> CVPR 2022. [PDF]
Towards High-Fidelity Face Self-Occlusion Recovery via Multi-View Residual-Based GAN Inversion.<br> Jinsong Chen, Hu Han, Shiguang Shan.<br> AAAI 2022. [PDF]
Time-Travel Rephotography.<br> Xuan Luo, Xuaner Zhang, Paul Yoo, Ricardo Martin-Brualla, Jason Lawrence, Steven M. Seitz.<br> SIGGRAPH Asia 2021 (TOG). [PDF] [Project] [Code]
GPEN: GAN Prior Embedded Network for Blind Face Restoration in the Wild.<br> Tao Yang, Peiran Ren, Xuansong Xie, Lei Zhang.<br> CVPR 2021. [PDF] [Code]
GLEAN: Generative Latent Bank for Large-Factor Image Super-Resolution.<br> Kelvin C.K. Chan, Xintao Wang, Xiangyu Xu, Jinwei Gu, Chen Change Loy.<br> CVPR 2021. [PDF] [Project] [Code]
GFP-GAN: Towards Real-World Blind Face Restoration with Generative Facial Prior.<br> Xintao Wang, Yu Li, Honglun Zhang, Ying Shan.<br> CVPR 2021. [PDF] [Project]
PULSE: Self-Supervised Photo Upsampling via Latent Space Exploration of Generative Models.<br> Sachit Menon, Alexandru Damian, Shijia Hu, Nikhil Ravi, Cynthia Rudin.<br> CVPR 2020. [PDF] [Code]
Style Generator Inversion for Image Enhancement and Animation.<br> Aviv Gabbay, Yedid Hoshen.<br> arxiv 2019. [PDF] [Project] [Code]
Image Understanding
Finding an Unsupervised Image Segmenter in each of your Deep Generative Models.<br> Luke Melas-Kyriazi, Christian Rupprecht, Iro Laina, Andrea Vedaldi.<br> ICLR 2022. [PDF]
Labels4Free: Unsupervised Segmentation using StyleGAN.<br> Rameen Abdal, Peihao Zhu, Niloy Mitra, Peter Wonka.<br> ICCV 2021. [PDF] [Project] [Code]
DatasetGAN: Efficient Labeled Data Factory with Minimal Human Effort.<br> Yuxuan Zhang, Huan Ling, Jun Gao, Kangxue Yin, Jean-Francois Lafleche, Adela Barriuso, Antonio Torralba, Sanja Fidler.<br> CVPR 2021. [PDF] [Code] [Project]
Repurposing GANs for One-shot Semantic Part Segmentation.<br> Nontawat Tritrong, Pitchaporn Rewatbowornwong, Supasorn Suwajanakorn.<br> CVPR 2021 (oral). [PDF] [Project] [Code]
Segmentation in Style: Unsupervised Semantic Image Segmentation with Stylegan and CLIP.<br> Daniil Pakhomov, Sanchit Hira, Narayani Wagle, Kemar E. Green, Nassir Navab.<br> arxiv 2021. [PDF] [Project] [Code]
Face Recognition
How to Boost Face Recognition with StyleGAN?<br> Artem Sevastopolsky, Yury Malkov, Nikita Durasov, Luisa Verdoliva, Matthias Nießner.<br> ICCV 2023. [PDF] [Code]
3D Reconstruction
StyleGAN Knows Normal, Depth, Albedo, and More.<br> Anand Bhattad, Daniel McKee, Derek Hoiem, D. A. Forsyth.<br> NeurIPS 2023. [PDF]
Shape, Pose, and Appearance from a Single Image via Bootstrapped Radiance Field Inversion.<br> Dario Pavllo, David Joseph Tan, Marie-Julie Rakotosaona, Federico Tombari.<br> CVPR 2023. [PDF] [Code]
Monocular 3D Object Reconstruction with GAN Inversion.<br> Junzhe Zhang, Daxuan Ren, Zhongang Cai, Chai Kiat Yeo, Bo Dai, Chen Change Loy.<br> ECCV 2022. [PDF] [Project] [Code]
CoordGAN: Self-Supervised Dense Correspondences Emerge from GANs.<br> Jiteng Mu, Shalini De Mello, Zhiding Yu, Nuno Vasconcelos, Xiaolong Wang, Jan Kautz, Sifei Liu.<br> CVPR 2022. [PDF] [Project]
Normalized Avatar Synthesis Using StyleGAN and Perceptual Refinement.<br> Huiwen Luo, Koki Nagano, Han-Wei Kung, Mclean Goldwhite, Qingguo Xu, Zejian Wang, Lingyu Wei, Liwen Hu, Hao Li.<br> CVPR 2021. [PDF]
Unsupervised 3D Shape Completion through GAN-Inversion.<br> Junzhe Zhang, Xinyi Chen, Zhongang Cai, Liang Pan, Haiyu Zhao, Shuai Yi, Chai Kiat Yeo, Bo Dai, Chen Change Loy.<br> CVPR 2021. [PDF] [Project]
Unsupervised 3D Shape Completion through GAN-Inversion.<br> Junzhe Zhang, Xinyi Chen, Zhongang Cai, Liang Pan, Haiyu Zhao, Shuai Yi, Chai Kiat Yeo, Bo Dai, Chen Change Loy.<br> CVPR 2021. [PDF] [Project]
OSTeC: One-Shot Texture Completion.<br> Baris Gecer, Jiankang Deng, Stefanos Zafeiriou.<br> CVPR 2021. [PDF] [Code]
GAN2Shape: Do 2D GANs Know 3D Shape? Unsupervised 3D shape reconstruction from 2D Image GANs.<br> Xingang Pan, Bo Dai, Ziwei Liu, Chen Change Loy, Ping Luo.<br> ICLR 2021 (oral). [PDF] [Code] [Project]
Other Applications
Compressed Sensing
Generator Surgery for Compressed Sensing.<br> Niklas Smedemark-Margulies, Jung Yeon Park, Max Daniels, Rose Yu, Jan-Willem van de Meent, Paul Hand.<br> NeurIPS 2020 Workshop on Deep Inverse. [PDF] [Code]
Task-Aware Compressed Sensing with Generative Adversarial Networks.<br> Maya Kabkab, Pouya Samangouei, Rama Chellappa.<br> AAAI 2018. [PDF]
Medical Imaging
Controllable Medical Image Generation via Generative Adversarial Networks.<br> Zhihang Ren, Stella X. Yu, David Whitney.<br> Human Vision and Electronic Imaging 2021. [PDF]
High-resolution Controllable Prostatic Histology Synthesis using StyleGAN.<br> Gagandeep B. Daroach, Josiah A. Yoder, Kenneth A. Iczkowski, Peter S. LaViolette.<br> Bioimaging 2021. [PDF]
Compression, Fairness, and Security
FairStyle: Debiasing StyleGAN2 with Style Channel Manipulations.<br> Cemre Karakas, Alara Dirik, Eylul Yalcinkaya, Pinar Yanardag.<br> ECCV 2022. [PDF] [Project] [Code]
Video Coding Using Learned Latent GAN Compression.<br> Mustafa Shukor, Bharath Bhushan Damodaran, Xu Yao, Pierre Hellier.<br> ACM MM 2022. [PDF]
Differentially Private Imaging via Latent Space Manipulation.<br> Tao Li, Chris Clifton.<br> IEEE Symposium on Security & Privacy (S&P) 2021. [PDF]
Acknowledgement
Thanks for the constructive comments from anonymous reviewers and feedback from Jun-Yan Zhu, Andrey Voynov, and Rushil Anirudh.
If you find this repo or our paper is helpful for your research, please consider to cite:
@article{xia2022gan,
author = {Xia, Weihao and Zhang, Yulun and Yang, Yujiu and Xue, Jing-Hao and Zhou, Bolei and Yang, Ming-Hsuan},
title = {GAN Inversion: A Survey},
journal = {IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI)},
year={2022}
}
<p width="100%" align="right"><a href="#">🔝</a></p>