Enhancing Image Restoration Transformer via Adaptive Translation Equivariance

JiaKui Hu; Zhengjian Yao; Lujia Jin; Hangzhou He; Yanye Lu

arXiv:2506.18520·cs.CV·June 24, 2025

Enhancing Image Restoration Transformer via Adaptive Translation Equivariance

JiaKui Hu, Zhengjian Yao, Lujia Jin, Hangzhou He, Yanye Lu

PDF

TL;DR

This paper introduces TEAFormer, a novel image restoration transformer that incorporates adaptive translation equivariance through slide indexing and component stacking, improving training, efficiency, and generalization.

Contribution

It proposes adaptive sliding indexing and component stacking strategies to preserve translation equivariance in transformers, addressing computational challenges and enhancing image restoration performance.

Findings

01

TEAFormer outperforms existing methods in various image restoration tasks.

02

The adaptive sliding indexing mechanism improves efficiency and effectiveness.

03

TEAFormer demonstrates better training convergence and generalization.

Abstract

Translation equivariance is a fundamental inductive bias in image restoration, ensuring that translated inputs produce translated outputs. Attention mechanisms in modern restoration transformers undermine this property, adversely impacting both training convergence and generalization. To alleviate this issue, we propose two key strategies for incorporating translation equivariance: slide indexing and component stacking. Slide indexing maintains operator responses at fixed positions, with sliding window attention being a notable example, while component stacking enables the arrangement of translation-equivariant operators in parallel or sequentially, thereby building complex architectures while preserving translation equivariance. However, these strategies still create a dilemma in model design between the high computational cost of self-attention and the fixed receptive field associated…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.