Awesome

Delving Deep into Intrinsic Image Harmonization

Here we provide the PyTorch implementation and pre-trained model of our latest version, if you require the code of our previous CVPR version ("Intrinsic Image Harmonization"), please click the release version.

Prerequisites

Linux
Python 3
CPU or NVIDIA GPU + CUDA CuDNN

Datasets

Download iHarmony4 dataset, and our HVIDIT dataset Google Drive or BaiduCloud (access code: akbi).

Base Model

Train

CUDA_VISIBLE_DEVICES=0 python train.py --model iih_base  --name iih_base_allihd_test  --dataset_root <dataset_dir> --dataset_name IHD --batch_size xx --init_port xxxx

Test

CUDA_VISIBLE_DEVICES=0 python test.py --model iih_base  --name iih_base_allihd_test  --dataset_root <dataset_dir> --dataset_name IHD --batch_size xx --init_port xxxx

Apply pre-trained model

Download pre-trained model from Google Drive or BaiduCloud (access code: n4js), and put latest_net_G.pth in the directory checkpoints/iih_base_allihd. Run:

CUDA_VISIBLE_DEVICES=0 python test.py --model iih_base  --name iih_base_allihd  --dataset_root <dataset_dir> --dataset_name IHD --batch_size xx --init_port xxxx

Base Model with Lighting

Train

CUDA_VISIBLE_DEVICES=0 python train.py --model iih_base_lt  --name iih_base_lt_allihd_test  --dataset_root <dataset_dir> --dataset_name IHD --batch_size xx --init_port xxxx

Test

CUDA_VISIBLE_DEVICES=0 python test.py --model iih_base_lt  --name iih_base_lt_allihd_test  --dataset_root <dataset_dir> --dataset_name IHD --batch_size xx --init_port xxxx

Apply pre-trained model

Download pre-trained model from Google Drive or BaiduCloud (access code: hqhw), and put latest_net_G.pth in the directory checkpoints/iih_base_lt_allihd. Run:

CUDA_VISIBLE_DEVICES=0 python test.py --model iih_base_lt  --name iih_base_lt_allihd  --dataset_root <dataset_dir> --dataset_name IHD --batch_size xx --init_port xxxx

Base Model with Guiding

Train

CUDA_VISIBLE_DEVICES=0 python train.py --model iih_base_gd --name iih_base_gd_allihd_test --dataset_root <dataset_dir> --dataset_name IHD --batch_size xx --init_port xxxx

Test

CUDA_VISIBLE_DEVICES=0 python test.py --model iih_base_gd --name iih_base_gd_allihd_test --dataset_root <dataset_dir> --dataset_name IHD --batch_size xx --init_port xxxx

Apply pre-trained model

Download pre-trained model from Google Drive or BaiduCloud (access code: nqrc), and put latest_net_G.pth in the directory checkpoints/iih_base_gd_allihd. Run:

CUDA_VISIBLE_DEVICES=0 python test.py --model iih_base_gd --name iih_base_gd_allihd --dataset_root <dataset_dir> --dataset_name IHD --batch_size xx --init_port xxxx

Base Model with Lighting and Guiding

Train

CUDA_VISIBLE_DEVICES=0 python train.py --model iih_base_lt_gd  --name iih_base_lt_gd_allihd_test  --dataset_root <dataset_dir> --dataset_name IHD --batch_size xx --init_port xxxx

Test

CUDA_VISIBLE_DEVICES=0 python test.py --model iih_base_lt_gd  --name iih_base_lt_gd_allihd_test  --dataset_root <dataset_dir> --dataset_name IHD --batch_size xx --init_port xxxx

Apply pre-trained model

Download pre-trained model from Google Drive or BaiduCloud (access code: kmgp), and put latest_net_G.pth in the directory checkpoints/iih_base_lt_gd_allihd. Run:

CUDA_VISIBLE_DEVICES=0 python test.py --model iih_base_lt_gd  --name iih_base_lt_gd_allihd  --dataset_root <dataset_dir> --dataset_name IHD --batch_size xx --init_port xxxx

Base Model with Lighting and Guiding on iHarmony4 and HVIDIT Datasets

Train

CUDA_VISIBLE_DEVICES=0 python train.py --model iih_base_lt_gd  --name iih_base_lt_gd_newihd_test  --dataset_root <dataset_dir> --dataset_name newIHD --batch_size xx --init_port xxxx

Test

CUDA_VISIBLE_DEVICES=0 python test.py --model iih_base_lt_gd  --name iih_base_lt_gd_newihd_test  --dataset_root <dataset_dir> --dataset_name newIHD --batch_size xx --init_port xxxx

Apply pre-trained model

Download pre-trained model from Google Drive or BaiduCloud (access code: jnhg), and put latest_net_G.pth in the directory checkpoints/iih_base_lt_gd_allihd. Run:

CUDA_VISIBLE_DEVICES=0 python test.py --model iih_base_lt_gd  --name iih_base_lt_gd_newihd  --dataset_root <dataset_dir> --dataset_name newIHD --batch_size xx --init_port xxxx

Evaluation

We provide the code in ih_evaluation.py. Run:

# iHarmony4 dataset
CUDA_VISIBLE_DEVICES=0 python evaluation/ih_evaluation.py --dataroot <dataset_dir> --result_root  results/experiment/test_latest/images/ --evaluation_type our --dataset_name ALL
# iHarmony4 and HVIDIT datasets
CUDA_VISIBLE_DEVICES=0 python evaluation/ih_evaluation.py --dataroot <dataset_dir> --result_root  results/experiment/test_latest/images/ --evaluation_type our --dataset_name newALL

Quantitative Result

<table class="tg"> <tr> <th class="tg-0pky" align="center">Dataset</th> <th class="tg-0pky" align="center">Metrics</th> <th class="tg-0pky" align="center">Composite</th> <th class="tg-0pky" align="center">Ours (iHarmony4)</th> <th class="tg-0pky" align="center">Ours (iHarmony4+HVIDIT)</th> </tr> <tr> <td class="tg-0pky" align="center">HCOCO</td> <td class="tg-0pky" align="center"> MSE PSNR SSIM fMSE fPSNR fSSIM </td> <td class="tg-0pky" align="right"> 69.37 33.99 0.9853 996.59 19.86 0.8257 </td> <td class="tg-0pky" align="right"> 21.61 37.82 0.9812 361.94 24.17 0.8736 </td> <td class="tg-0pky" align="right"> 21.51 37.81 0.9812 363.76 24.17 0.8735 </td> </tr> <tr> <td class="tg-0pky" align="center">HAdobe5k</td> <td class="tg-0pky" align="center"> MSE PSNR SSIM fMSE fPSNR fSSIM </td> <td class="tg-0pky" align="right"> 345.54 28.52 0.9483 2051.61 17.52 0.7295 </td> <td class="tg-0pky" align="right"> 40.67 36.61 0.9362 259.05 26.36 0.8413 </td> <td class="tg-0pky" align="right"> 39.27 36.60 0.9364 259.91 26.32 0.8407 </td> </tr> <tr> <td class="tg-0pky" align="center">HFlickr</td> <td class="tg-0pky" align="center"> MSE PSNR SSIM fMSE fPSNR fSSIM </td> <td class="tg-0pky" align="right"> 264.35 28.43 0.9620 1574.37 18.09 0.8036 </td> <td class="tg-0pky" align="right"> 94.91 32.10 0.9614 638.36 21.97 0.8444 </td> <td class="tg-0pky" align="right"> 94.25 32.06 0.9615 635.73 21.92 0.8436 </td> </tr> <tr> <td class="tg-0pky" align="center">Hday2night</td> <td class="tg-0pky" align="center"> MSE PSNR SSIM fMSE fPSNR fSSIM </td> <td class="tg-0pky" align="right"> 109.65 34.36 0.9607 1409.98 19.14 0.6353 </td> <td class="tg-0pky" align="right"> 51.44 37.06 0.9308 740.59 22.40 0.6585 </td> <td class="tg-0pky" align="right"> 59.87 36.42 0.9318 856.95 21.73 0.6549 </td> </tr> <tr> <td class="tg-0pky" align="center">HVIDIT</td> <td class="tg-0pky" align="center"> MSE PSNR SSIM fMSE fPSNR fSSIM </td> <td class="tg-0pky" align="right"> 53.12 38.72 0.9922 1604.41 19.01 0.7614 </td> <td class="tg-0pky" align="right"> - - - - - </td> <td class="tg-0pky" align="right"> 25.51 41.43 0.9919 738.66 21.86 0.7139 </td> </tr> <tr> <td class="tg-0pky" align="center">ALL</td> <td class="tg-0pky" align="center"> MSE PSNR SSIM fMSE fPSNR fSSIM </td> <td class="tg-0pky" align="right"> 167.39 32.07 0.9724 1386.12 18.97 0.7905 </td> <td class="tg-0pky" align="right"> 35.90 36.81 0.9649 369.64 24.53 0.8571 </td> <td class="tg-0pky" align="right"> 35.09 36.99 0.9662 388.30 24.39 0.8506 </td> </tr> </table>

Real composite image harmonnization

More compared results can be found at Google Drive or BaduCloud (access code: lgs2).

Bibtex

If you use this code for your research, please cite our papers.

@InProceedings{Guo_2021_CVPR,
    author    = {Guo, Zonghui and Zheng, Haiyong and Jiang, Yufeng and Gu, Zhaorui and Zheng, Bing},
    title     = {Intrinsic Image Harmonization},
    booktitle = {Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)},
    month     = {June},
    year      = {2021},
    pages     = {16367-16376}
}

Acknowledgement

For some of the data modules and model functions used in this source code, we need to acknowledge the repo of DoveNet and CycleGAN.