Awesome

SingleGAN

Pytorch implementation of our paper: "SingleGAN: Image-to-Image Translation by a Single-Generator Network using Multiple Generative Adversarial Learning".

By leveraging multiple adversarial learning, our model can perform multi-domain and multi-modal image translation with a single generator.

Base model:

Extended models:

Dependencies

Python 3.x
Pytorch 1.1.0 or later

you can install all the dependencies by

pip install -r requirements.txt

Getting Started

Datasets

You can either download the default datasets (from pix2pix and CycleGAN) or unzip your own dataset into datasets directory.
- Download a default dataset (e.g. apple2orange):
```
bash ./download_datasets.sh apple2orange
```
- Please ensure that you have the following directory tree structure in your repository.
```
├── datasets
│   └── apple2orange
│       ├── trainA
│       ├── testA
│       ├── trainB
│       ├── testB
│        ...
```
- Transient-Attributes dataset can be requested from here.

Training

Train a base model (e.g. apple2orange):

bash ./scripts/train_base.sh apple2orange

To view training results and loss plots, run python -m visdom.server and click the URL http://localhost:8097. More intermediate results can be found in checkpoints directory.

Testing

Check the folder name in checkpoints directory (e.g. apple2orange).

├── checkpoints
│   └── base_apple2orange
│       └── 2018_10_16_14_49_55
│           └ ...

Run

bash ./scripts/test_base.sh apple2orange 2018_10_16_14_49_55

The testing results will be saved in checkpoints/base_apple2orange/2018_10_16_14_49_55/results directory.

In recent experiments, we found that spectral normaliation (SN) can help stabilize the training stage. So we add SN in this implementation. You may need to update your pytorch to 0.4.1 to support SN or use an old version without SN.

Results

Unsupervised cross-domain translation:

<img src='images/base.jpg' align="center" width='100%'

Unsupervised one-to-many translation:

<img src='images/one2many.jpg' align="center" width='90%'

Unsupervised many-to-many translation:

<img src='images/many2many.jpg' align="center" width='60%'

Unsupervised multimodal translation:

Cat ↔ Dog:

<img src='images/cat.jpg' width='18%' /><img src='images/cat2dog.gif' width='18%' /> <img src='images/dog.jpg' width='18%'/><img src='images/dog2cat.gif' width='18%'/> Label ↔ Facade: <img src='images/label.jpg' width='18%' /><img src='images/label2facade.gif' width='18%' /> <img src='images/facade.jpg' width='18%'/><img src='images/facade2label.gif' width='18%'/> Edge ↔ Shoes: <img src='images/edge.jpg' width='18%' /><img src='images/edge2shoe.gif' width='18%' /> <img src='images/shoe.jpg' width='18%'/><img src='images/shoe2edge.gif' width='18%'/>

Please note that this repository contains only the unsupervised version of SingleGAN, you can implement the supervised version by overloading the data loader and replacing the cycle consistency loss with reconstruction loss. See more details in our paper.

bibtex

If this work is useful for your research, please consider citing :

@inproceedings{yu2018singlegan,    
	title={SingleGAN: Image-to-Image Translation by a Single-Generator Network using Multiple Generative Adversarial Learning},    
	author={Yu, Xiaoming and Cai, Xing and Ying, Zhenqiang and Li, Thomas and Li, Ge},    
	booktitle={Asian Conference on Computer Vision},    
	year={2018}
 }

Acknowledgement

The code used in this research is inspired by BicycleGAN.

Contact

Feel free to reach me if there is any questions (Xiaoming-Yu@pku.edu.cn).