Towards Vision Transformer Unrolling Fixed-Point Algorithm: a Case Study   on Image Restoration

Peng Qiao; Sidun Liu; Tao Sun; Ke Yang; Yong Dou

arXiv:2301.12332·cs.CV·January 31, 2023

Towards Vision Transformer Unrolling Fixed-Point Algorithm: a Case Study on Image Restoration

Peng Qiao, Sidun Liu, Tao Sun, Ke Yang, Yong Dou

PDF

Open Access

TL;DR

This paper introduces Transformer-based unrolling frameworks for fixed-point algorithms in image restoration, achieving competitive performance with fewer parameters and training time, and demonstrating the potential of Transformers in low-level vision tasks.

Contribution

It proposes FPformer, FPRformer, and FPAformer frameworks that unroll fixed-point algorithms with Transformers, incorporating parameter sharing and acceleration techniques for improved efficiency and performance.

Findings

01

FPAformer achieves superior performance with only 29.82% parameters of SwinIR.

02

The models are trained with only 26.9% of the time required for SwinIR.

03

Self-supervised pre-training enhances the models' effectiveness in image restoration.

Abstract

The great success of Deep Neural Networks (DNNs) has inspired the algorithmic development of DNN-based Fixed-Point (DNN-FP) for computer vision tasks. DNN-FP methods, trained by Back-Propagation Through Time or computing the inaccurate inversion of the Jacobian, suffer from inferior representation ability. Motivated by the representation power of the Transformer, we propose a framework to unroll the FP and approximate each unrolled process via Transformer blocks, called FPformer. To reduce the high consumption of memory and computation, we come up with FPRformer by sharing parameters between the successive blocks. We further design a module to adapt Anderson acceleration to FPRformer to enlarge the unrolled iterations and improve the performance, called FPAformer. In order to fully exploit the capability of the Transformer, we apply the proposed model to image restoration, using…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Image Processing Techniques · Image Processing Techniques and Applications · Sparse and Compressive Sensing Techniques

MethodsAttention Is All You Need · Linear Layer · Softmax · Absolute Position Encodings · Byte Pair Encoding · Adam · Layer Normalization · Label Smoothing · Multi-Head Attention · Dense Connections