On Efficient Transformer-Based Image Pre-training for Low-Level Vision

Wenbo Li; Xin Lu; Shengju Qian; Jiangbo Lu; Xiangyu Zhang; Jiaya Jia

arXiv:2112.10175·cs.CV·March 22, 2022·26 cites

On Efficient Transformer-Based Image Pre-training for Low-Level Vision

Wenbo Li, Xin Lu, Shengju Qian, Jiangbo Lu, Xiangyu Zhang, Jiaya Jia

PDF

Open Access 1 Repo

TL;DR

This paper investigates how transformer-based pre-training influences low-level vision tasks, revealing task-specific effects and proposing effective multi-task pre-training strategies to achieve state-of-the-art results.

Contribution

It provides a comprehensive analysis of pre-training effects on low-level vision tasks and introduces effective multi-task pre-training methods for transformers.

Findings

01

Pre-training introduces more local information in super-resolution.

02

Pre-training has limited impact on denoising internal features.

03

Multi-related-task pre-training outperforms other methods.

Abstract

Pre-training has marked numerous state of the arts in high-level computer vision, while few attempts have ever been made to investigate how pre-training acts in image processing systems. In this paper, we tailor transformer-based pre-training regimes that boost various low-level tasks. To comprehensively diagnose the influence of pre-training, we design a whole set of principled evaluation tools that uncover its effects on internal representations. The observations demonstrate that pre-training plays strikingly different roles in low-level tasks. For example, pre-training introduces more local information to higher layers in super-resolution (SR), yielding significant performance gains, while pre-training hardly affects internal feature representations in denoising, resulting in limited gains. Further, we explore different methods of pre-training, revealing that multi-related-task…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

fenglinglwb/edt
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Image Processing Techniques · Image and Signal Denoising Methods · Image Processing Techniques and Applications