Awesome
<div align="center">Transforming Image Super-Resolution: A ConvFormer-based Efficient Approach
Gang Wu, Junjun Jiang, Junpeng Jiang, and Xianming Liu
AIIA Lab, Harbin Institute of Technology.
paper | results | pretrained models
</div>This repository is the official PyTorch implementation of "Transforming Image Super-Resolution: A ConvFormer-based Efficient Approach"
Recent progress in single-image super-resolution (SISR) has achieved remarkable performance, yet the computational costs of these methods remain a challenge for deployment on resource-constrained devices. Especially for transformer-based methods, the self-attention mechanism in such models brings great breakthroughs while incurring substantial computational costs. To tackle this issue, we introduce the Convolutional Transformer layer (ConvFormer) and the ConvFormer-based Super-Resolution network (CFSR), which offer an effective and efficient solution for lightweight image super-resolution tasks. In detail, CFSR leverages the large kernel convolution as the feature mixer to replace the self-attention module, efficiently modeling long-range dependencies and extensive receptive fields with a slight computational cost. Furthermore, we propose an edge-preserving feed-forward network, simplified as EFN, to obtain local feature aggregation and simultaneously preserve more high-frequency information. Extensive experiments demonstrate that CFSR can achieve an advanced trade-off between computational cost and performance when compared to existing lightweight SR methods. Compared to state-of-the-art methods, e.g. ShuffleMixer, the proposed CFSR achieves \textit{0.39 dB} gains on Urban100 dataset for $\times2$ SR task while containing\textit{ 26% }and \textit{31%} fewer parameters and FLOPs, respectively.
Results
Results of x2, x3, and x4 SR tasks are available at Google Drive
Method | Scale | Params | FLOPs | Set5 (PSNR/SSIM) | Set14 (PSNR/SSIM) | B100 (PSNR/SSIM) | Urban100 (PSNR/SSIM) | Manga109 (PSNR/SSIM) |
---|---|---|---|---|---|---|---|---|
VDSR | $666 \mathrm{~K}$ | $612.6 \mathrm{G}$ | $31.35 / 0.8838$ | $28.01 / 0.7674$ | $27.29 / 0.7251$ | $25.18 / 0.7524$ | $28.83 / 0.8870$ | |
LapSRN | $813 \mathrm{~K}$ | $149.4 \mathrm{G}$ | $31.54 / 0.8852$ | $28.09 / 0.7700$ | $27.32 / 0.7275$ | $25.21 / 0.7562$ | $29.09 / 0.8900$ | |
IDN | $553 \mathrm{~K}$ | $32.3 \mathrm{G}$ | $31.82 / 0.8903$ | $28.25 / 0.7730$ | $27.41 / 0.7297$ | $25.41 / 0.7632$ | $29.41 / 0.8942$ | |
CARN | $592 \mathrm{~K}$ | $90.9 \mathrm{G}$ | $32.13 / 0.8937$ | $28.60 / 0.7806$ | $27.58 / 0.7349$ | $26.07 / 0.7837$ | $30.47 / 0.9084$ | |
SRResNet | $1,518 \mathrm{~K}$ | $146 \mathrm{G}$ | $32.17 / 0.8951$ | $28.61 / 0.7823$ | $27.59 / 0.7365$ | $26.12 / 0.7871$ | $30.48 / 0.9087$ | |
IMDN | $715 \mathrm{~K}$ | $40.9 \mathrm{G}$ | $32.21 / 0.8948$ | $28.58 / 0.7811$ | $27.56 / 0.7353$ | $26.04 / 0.7838$ | $30.45 / 0.9075$ | |
LatticeNet | $777 \mathrm{~K}$ | $43.6 \mathrm{G}$ | $32.18 / 0.8943$ | $28.61 / 0.7812$ | $27.57 / 0.7355$ | $26.14 / 0.7844$ | $-/-$ | |
LAPAR-A | $\times 4$ | $659 \mathrm{~K}$ | $94.0 \mathrm{G}$ | $32.15 / 0.8944$ | $28.61 / 0.7818$ | $27.61 / 0.7366$ | $26.14 / 0.7871$ | $30.42 / 0.9074$ |
SMSR | $1006 \mathrm{~K}$ | $41.6 \mathrm{G}$ | $32.12 / 0.8932$ | $28.55 / 0.7808$ | $27.55 / 0.7351$ | $26.11 / 0.7868$ | $30.54 / 0.9085$ | |
ECBSR | $603 \mathrm{~K}$ | $34.7 \mathrm{G}$ | $31.92 / 0.8946$ | $28.34 / 0.7817$ | $27.48 / 0.7393$ | $25.81 / 0.7773$ | $-/-$ | |
PAN | $272 \mathrm{~K}$ | $28.2 \mathrm{G}$ | $32.13 / 0.8948$ | $28.61 / 0.7822$ | $27.59 / 0.7363$ | $26.11 / 0.7854$ | $30.51 / 0.9095$ | |
DRSAN | $410 \mathrm{~K}$ | $30.5 \mathrm{G}$ | $32.15 / 0.8935$ | $28.54 / 0.7813$ | $27.54 / 0.7364$ | $26.06 / 0.7858$ | $-/-$ | |
DDistill-SR | $434 \mathrm{~K}$ | $33.0 \mathrm{G}$ | $32.23 / 0.8960$ | $28.62 / 0.7823$ | $27.58 / 0.7365$ | $26.20 / 0.7891$ | $30.48 / 0.9090$ | |
RFDN | $550 \mathrm{~K}$ | $23.9 \mathrm{G}$ | $32.24 / 0.8952$ | $28.61 / 0.7819$ | $27.57 / 0.7360$ | $26.11 / 0.7858$ | $30.58 / 0.9089$ | |
ShuffleMixer | $411 K$ | $28.0 \mathrm{G}$ | $32.21 / 0.8953$ | $28.66 / 0.7827$ | $27.61 / 0.7366$ | $26.08 / 0.7835$ | $30.65 / 0.9093$ | |
CFSR (Ours) | $307 \mathrm{~K}$ | $17.5 \mathrm{G}$ | $32.33/0.8964$ | $28.73 / 0.7842$ | $27.63 / 0.7381$ | $26.21/0.7897$ | $30.72 / 0.9111$ |
TODO
- Add implementation code
- Add pretrained model