Home

Awesome

Distillation-Free One-Step Diffusion for Real-World Image Super-Resolution

Jianze Li, Jiezhang Cao, Zichen Zou, Xiongfei Su, Xin Yuan, Yulun Zhang, Yong Guo, and Xiaokang Yang, "Distillation-Free One-Step Diffusion for Real-World Image Super-Resolution", arXiv, 2024

[arXiv] [supplementary material] [visual results] [models]

Abstract: Diffusion models have been achieving excellent performance for real-world image super-resolution (Real-ISR) with considerable computational costs. Current approaches are trying to derive one-step diffusion models from multi-step counterparts through knowledge distillation. However, these methods incur substantial training costs and may constrain the performance of the student model by the teacher's limitations. To tackle these issues, we propose DFOSD, a Distillation-Free One-Step Diffusion model. Specifically, we propose a noise-aware discriminator (NAD) to participate in adversarial training, further enhancing the authenticity of the generated content. Additionally, we improve the perceptual loss with edge-aware DISTS (EA-DISTS) to enhance the model's ability to generate fine details. Our experiments demonstrate that, compared with previous diffusion-based methods requiring dozens or even hundreds of steps, our DFOSD attains comparable or even superior results in both quantitative metrics and qualitative evaluations. Our DFOSD also abtains higher performance and efficiency compared with other one-step diffusion methods.

Training framework of DFOSD.

<p align="center"> <img src="figs/compare.png"> </p> Visual comparisons (x4) of different DM-based Real-ISR methods, including their inference times and MACs (Multiply-Accumulate Operations), for an output size of 512x512. The inference times are measured on an A100 GPU.

⚒️ TODO

🔗 Contents

🔎 Results

<details> <summary>DFOSD achieves superior performance on Real-world datasets. (click to expand)</summary> <p align="center"> <img width="90%" src="figs/NR_DM_compare.png"> </p> </details> <details> <summary>DFOSD achieves the fastest inference speed. (click to expand)</summary> <p align="center"> <img width="90%" src="figs/complexity.png"> </p> </details> <details> <summary>Visual Results. (click to expand)</summary> <p align="center"> <img width="100%" src="figs/vsiual_results.png"> </p> </details>

Citation

If you find the code helpful in your research or work, please cite the following paper(s).

@article{li2024distillation,
  title={Distillation-Free One-Step Diffusion for Real-World Image Super-Resolution},
  author={Li, Jianze and Cao, Jiezhang and Zou, Zichen and Su, Xiongfei and Yuan, Xin and Zhang, Yulun and Guo, Yong and Yang, Xiaokang},
  journal={arXiv preprint arXiv:2410.04224},
  year={2024}
}

💡 Acknowledgements

[TBD]