Towards Vision Transformer Unrolling Fixed-Point Algorithm: a Case Study on Image Restoration
Peng Qiao, Sidun Liu, Tao Sun, Ke Yang, Yong Dou

TL;DR
This paper introduces Transformer-based unrolling frameworks for fixed-point algorithms in image restoration, achieving competitive performance with fewer parameters and training time, and demonstrating the potential of Transformers in low-level vision tasks.
Contribution
It proposes FPformer, FPRformer, and FPAformer frameworks that unroll fixed-point algorithms with Transformers, incorporating parameter sharing and acceleration techniques for improved efficiency and performance.
Findings
FPAformer achieves superior performance with only 29.82% parameters of SwinIR.
The models are trained with only 26.9% of the time required for SwinIR.
Self-supervised pre-training enhances the models' effectiveness in image restoration.
Abstract
The great success of Deep Neural Networks (DNNs) has inspired the algorithmic development of DNN-based Fixed-Point (DNN-FP) for computer vision tasks. DNN-FP methods, trained by Back-Propagation Through Time or computing the inaccurate inversion of the Jacobian, suffer from inferior representation ability. Motivated by the representation power of the Transformer, we propose a framework to unroll the FP and approximate each unrolled process via Transformer blocks, called FPformer. To reduce the high consumption of memory and computation, we come up with FPRformer by sharing parameters between the successive blocks. We further design a module to adapt Anderson acceleration to FPRformer to enlarge the unrolled iterations and improve the performance, called FPAformer. In order to fully exploit the capability of the Transformer, we apply the proposed model to image restoration, using…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Image Processing Techniques · Image Processing Techniques and Applications · Sparse and Compressive Sensing Techniques
MethodsAttention Is All You Need · Linear Layer · Softmax · Absolute Position Encodings · Byte Pair Encoding · Adam · Layer Normalization · Label Smoothing · Multi-Head Attention · Dense Connections
