Home

Awesome

Permutation invariance and uncertainty in multitemporal image super-resolution

PIUnet is a novel Multi Image Super-Resolution (MISR) deep neural network that exploits both spatial and temporal correlations to recover a single high resolution image from multiple unregistered low resolution images. It is fully invariant to the ordering of the input images and produces an estimate of the aleatoric uncertainty together with the super-resolved image.

This repository contains the pytorch implementation of PIUnet, trained and tested on the PROBA-V dataset provided by ESA’s Advanced Concepts Team in the context of the European Space Agency's Kelvin competition.

BibTex reference:

@article{valsesia2021piunet,
  title={Permutation invariance and uncertainty in multitemporal image super-resolution},
  author={Valsesia, Diego and Magli, Enrico},
  journal={IEEE Transactions on Geoscience and Remote Sensing},
  year={2022},
  volume={60},
  number={},
  pages={1-12},
  doi={10.1109/TGRS.2021.3130673}
}

Setup to get started

Make sure you have Python3 and all the required python packages installed:

pip install -r requirements.txt

The code has been tested on Pytorch 1.7.1 for CUDA 10.1. Newer version will likely work as well.

Load data from Kelvin Competition and create the training set and the validation set

Usage

"N_feat" : no. of feature channels (42)
"R_bneck" : no. of features in attention bottlenecks (8)
"N_tefa": no. of TEFA blocks (16),
"N_heads": no. of heads for multihead attention (1),
"patch_size": training patch size (32),
"batch_size": training batch size (24),
"N_epoch": no. of training epochs (200),
"learning_rate": optimizer learning rate (1e-4),
"train_lr_file": path to npy file with train LR images,
"train_hr_file": path to npy file with train HR images,
"train_masks_file": path to npy file with train image masks,
"val_lr_file": path to npy file with validation LR images,
"val_hr_file": path to npy file with validation HR images,
"val_masks_file": path to npy file with validation image masks,
"max_train_scenes": maximum no. of images to be used for training (full dataset: 563 for NIR, 591 for RED)
./launcher_train.sh

OR (after making sure the supplied log_dir and save_dir exist)

CUDA_VISIBLE_DEVICES=0 python Code/piunet/main.py --log_dir log_dir/piunet/ --save_dir Results/piunet/

Default parameters require approximately 18GB of GPU memory for training.

Pretrained models

Testing

Authors & Contacts

PIUnet is based on work by the Image Processing and Learning group of Politecnico di Torino. Contacts: Diego Valsesia (diego.valsesia AT polito.it), Enrico Magli (enrico.magli AT polito.it).