Home

Awesome

<div align="center">

Transforming Image Super-Resolution: A ConvFormer-based Efficient Approach

Gang Wu, Junjun Jiang, Junpeng Jiang, and Xianming Liu

AIIA Lab, Harbin Institute of Technology.


paper | results | pretrained models

Hits

</div>

This repository is the official PyTorch implementation of "Transforming Image Super-Resolution: A ConvFormer-based Efficient Approach"

Recent progress in single-image super-resolution (SISR) has achieved remarkable performance, yet the computational costs of these methods remain a challenge for deployment on resource-constrained devices. Especially for transformer-based methods, the self-attention mechanism in such models brings great breakthroughs while incurring substantial computational costs. To tackle this issue, we introduce the Convolutional Transformer layer (ConvFormer) and the ConvFormer-based Super-Resolution network (CFSR), which offer an effective and efficient solution for lightweight image super-resolution tasks. In detail, CFSR leverages the large kernel convolution as the feature mixer to replace the self-attention module, efficiently modeling long-range dependencies and extensive receptive fields with a slight computational cost. Furthermore, we propose an edge-preserving feed-forward network, simplified as EFN, to obtain local feature aggregation and simultaneously preserve more high-frequency information. Extensive experiments demonstrate that CFSR can achieve an advanced trade-off between computational cost and performance when compared to existing lightweight SR methods. Compared to state-of-the-art methods, e.g. ShuffleMixer, the proposed CFSR achieves \textit{0.39 dB} gains on Urban100 dataset for $\times2$ SR task while containing\textit{ 26% }and \textit{31%} fewer parameters and FLOPs, respectively.

Results

Results of x2, x3, and x4 SR tasks are available at Google Drive

MethodScaleParamsFLOPsSet5 (PSNR/SSIM)Set14 (PSNR/SSIM)B100 (PSNR/SSIM)Urban100 (PSNR/SSIM)Manga109 (PSNR/SSIM)
VDSR$666 \mathrm{~K}$$612.6 \mathrm{G}$$31.35 / 0.8838$$28.01 / 0.7674$$27.29 / 0.7251$$25.18 / 0.7524$$28.83 / 0.8870$
LapSRN$813 \mathrm{~K}$$149.4 \mathrm{G}$$31.54 / 0.8852$$28.09 / 0.7700$$27.32 / 0.7275$$25.21 / 0.7562$$29.09 / 0.8900$
IDN$553 \mathrm{~K}$$32.3 \mathrm{G}$$31.82 / 0.8903$$28.25 / 0.7730$$27.41 / 0.7297$$25.41 / 0.7632$$29.41 / 0.8942$
CARN$592 \mathrm{~K}$$90.9 \mathrm{G}$$32.13 / 0.8937$$28.60 / 0.7806$$27.58 / 0.7349$$26.07 / 0.7837$$30.47 / 0.9084$
SRResNet$1,518 \mathrm{~K}$$146 \mathrm{G}$$32.17 / 0.8951$$28.61 / 0.7823$$27.59 / 0.7365$$26.12 / 0.7871$$30.48 / 0.9087$
IMDN$715 \mathrm{~K}$$40.9 \mathrm{G}$$32.21 / 0.8948$$28.58 / 0.7811$$27.56 / 0.7353$$26.04 / 0.7838$$30.45 / 0.9075$
LatticeNet$777 \mathrm{~K}$$43.6 \mathrm{G}$$32.18 / 0.8943$$28.61 / 0.7812$$27.57 / 0.7355$$26.14 / 0.7844$$-/-$
LAPAR-A$\times 4$$659 \mathrm{~K}$$94.0 \mathrm{G}$$32.15 / 0.8944$$28.61 / 0.7818$$27.61 / 0.7366$$26.14 / 0.7871$$30.42 / 0.9074$
SMSR$1006 \mathrm{~K}$$41.6 \mathrm{G}$$32.12 / 0.8932$$28.55 / 0.7808$$27.55 / 0.7351$$26.11 / 0.7868$$30.54 / 0.9085$
ECBSR$603 \mathrm{~K}$$34.7 \mathrm{G}$$31.92 / 0.8946$$28.34 / 0.7817$$27.48 / 0.7393$$25.81 / 0.7773$$-/-$
PAN$272 \mathrm{~K}$$28.2 \mathrm{G}$$32.13 / 0.8948$$28.61 / 0.7822$$27.59 / 0.7363$$26.11 / 0.7854$$30.51 / 0.9095$
DRSAN$410 \mathrm{~K}$$30.5 \mathrm{G}$$32.15 / 0.8935$$28.54 / 0.7813$$27.54 / 0.7364$$26.06 / 0.7858$$-/-$
DDistill-SR$434 \mathrm{~K}$$33.0 \mathrm{G}$$32.23 / 0.8960$$28.62 / 0.7823$$27.58 / 0.7365$$26.20 / 0.7891$$30.48 / 0.9090$
RFDN$550 \mathrm{~K}$$23.9 \mathrm{G}$$32.24 / 0.8952$$28.61 / 0.7819$$27.57 / 0.7360$$26.11 / 0.7858$$30.58 / 0.9089$
ShuffleMixer$411 K$$28.0 \mathrm{G}$$32.21 / 0.8953$$28.66 / 0.7827$$27.61 / 0.7366$$26.08 / 0.7835$$30.65 / 0.9093$
CFSR (Ours)$307 \mathrm{~K}$$17.5 \mathrm{G}$$32.33/0.8964$$28.73 / 0.7842$$27.63 / 0.7381$$26.21/0.7897$$30.72 / 0.9111$

TODO