Home

Awesome

LaKDNet: Revisiting Image Deblurring with an Efficient ConvNet

Lingyan Ruan, Mojtaba Bemana, Hans-peter Seidel, Karol Myszkowski, Bin Chen

Max-Planck-Institut fur Informatik

PWC

PWC

PWC

PWC

PWC

Abstract: The recent advancements in Transformers for computer vision tasks have had a notable impact on the field of image restoration. This has led to the development of generic structures, such as Uformer, and Restormer, which have shown superior performance over dedicated task-specific CNNs. The success of these structures can be attributed to their ability to handle long-range interactions, which is believed to be lacking in CNNs. The aim of this paper is to address the limitations of CNN-based structures and enable them to perform image restoration tasks such as motion and defocus deblurring with comparable effectiveness to Transformers. To investigate the factors contributing to restoration performance differences, we analyze the effective receptive field (ERF) of 10 existing methods and propose a metric called ERFMeter to compare ERF across different architectures. Our analysis reveals that the global and local properties of ERF are crucial for achieving superior performance. Based on these findings, we propose a CNN structure called LaKDNet, incorporating a large kernel convolution and mixer shortcuts scheme to enhance the global and local properties of ERF in CNNs. This approach demonstrates higher efficiency than generic Transformer works, as well as existing CNNs with multi-scale-stage strategies. Specifically, we achieve a PSNR improvement of +0.80 dB / +0.67 dB over the state-of-the-art Restormer / Uformer on the GOPRO dataset. Our findings suggest that there are still rooms to improve the performance of CNN when refining the network structure towards an optimal ERF.

<hr />

paper [Will update the results with the large model released here.]

@misc{ruan2023revisiting,
      title={Revisiting Image Deblurring with an Efficient ConvNet}, 
      author={Lingyan Ruan and Mojtaba Bemana and Hans-peter Seidel and Karol Myszkowski and Bin Chen},
      year={2023},
      eprint={2302.02234},
      archivePrefix={arXiv},
      primaryClass={cs.CV}
}

Effective Receptive Field Analysis

Motion & Defocus

<img src = "./assets/ERF_demo.png" width='60%' height ='60%'>

Visual Performance

<img src = "./assets/visual_performance.png" width='80%' height ='80%'>

Performance vs. Computational Cost

<img src = "./assets/assets_params.png" width='80%' height ='80%'>

Visual Result and Pre-trained Models

Please download the weights and visual result in this webpage

TaskTrained DatasetEvaluated DatasetPSNRSSIM#params
MotionGoProGoPro33.350.96417.1M
MotionGoProGoPro33.720.96737.5M
MotionGoProHIDE31.210.94317.1M
MotionGoProHIDE31.580.94637.5M
MotionGoProRealBlur-R35.900.95417.1M
MotionGoProRealBlur-R36.080.95537.5M
MotionGoProRealBlur-J28.780.87817.1M
MotionGoProRealBlur-J28.590.87037.5M
MotionRealBlur-RRealBlur-R39.910.97417.1M
MotionRealBlur-RRealBlur-R40.250.97537.5M
MotionRealBlur-JRealBlur-J32.330.92917.1M
MotionRealBlur-JRealBlur-J32.680.93237.5M
Defocus-singleDPDDDPDD26.150.81017.7M
Defocus-singleDPDDDPDD26.250.81338.4M
Defocus-singleDPDDRealDOF25.080.76217.7M
Defocus-singleDPDDRealDOF25.130.76538.4M
Defocus-singleLFDOFLFDOF31.870.91217.7M
Defocus-singleLFDOF & DPDDDPDD25.890.79217.7M
Defocus-singleLFDOF & DPDDRealDOF25.830.76917.7M
Defocus-dualDPDDDPDD26.720.82617.7M
Defocus-dualDPDDDPDD26.890.83238.4M

note: we increase the channel dimension from 32 to 48 to get the large version.

Instruction on Evaluation

note: Training will come later.

Prerequisites

Ubuntu Python CUDA PyTorch

Notes: the code may also work with other library versions that didn't specify here.

1. Installation

Clone this project to your local machine

$ git clone https://github.com/lingyanruan/LaKDNet.git
$ cd LaKDNet

2. Environment setup

$ conda create -y --name LaKDNet python=3.8.13 && conda activate LaKDNet
$ sh install_CUDA11.1.1.sh
# Other version will be checked and updated later.

3. Pre-trained models

Download and unzip under ./ckpts/ from webpage:

4. Datasets download

Download and unzip under ./Test_sets/ from webpage:

The original full defocus datasets could be found here: (LFDOF, DPDD, CUHK and RealDOF):

The original full motion datasets could be found here: (GOPRO, HIDE, REALR&REALJ).

5. Command Line

# type could be Motion or Defocus 
# for motion evaluation
$ python run.py --type Motion

# for defocus evaluation
$ python run.py --type Defocus

Contact

Should you have any questions, please open an issue or contact me lyruanruan@gmail.com

Acknowledgment: Thanks for all the fabulous works included in our comparsions, e.g. Restormer, Uformer, NAFNet.

License

This software is being made available under the terms in the LICENSE file.