LKFMixer: Exploring Large Kernel Feature For Efficient Image Super-Resolution

Yinggan Tang; Quanwei Hu

arXiv:2508.11391·eess.IV·August 18, 2025

LKFMixer: Exploring Large Kernel Feature For Efficient Image Super-Resolution

Yinggan Tang, Quanwei Hu

PDF

TL;DR

LKFMixer is a convolutional neural network that uses large kernels and novel modules to efficiently capture non-local features for superior image super-resolution, outperforming state-of-the-art methods in quality and speed.

Contribution

The paper introduces LKFMixer, a CNN with large kernels and new modules that simulate self-attention for efficient super-resolution without heavy computation.

Findings

01

Outperforms SOTA methods in SR quality and reconstruction.

02

Achieves 0.6dB PSNR improvement over SwinIR-light on Manga109.

03

Runs 5 times faster than comparable models.

Abstract

The success of self-attention (SA) in Transformer demonstrates the importance of non-local information to image super-resolution (SR), but the huge computing power required makes it difficult to implement lightweight models. To solve this problem, we propose a pure convolutional neural network (CNN) model, LKFMixer, which utilizes large convolutional kernel to simulate the ability of self-attention to capture non-local features. Specifically, we increase the kernel size to 31 to obtain the larger receptive field as possible, and reduce the parameters and computations by coordinate decomposition. Meanwhile, a spatial feature modulation block (SFMB) is designed to enhance the focus of feature information on both spatial and channel dimension. In addition, by introducing feature selection block (FSB), the model can adaptively adjust the weights between local features and non-local…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.