Detail-Preserving Transformer for Light Field Image Super-Resolution
Shunzhou Wang, Tianfei Zhou, Yao Lu, Huijun Di

TL;DR
This paper introduces a Transformer-based model for light field image super-resolution that effectively captures global and local dependencies, leading to improved detail recovery and superior performance over existing methods.
Contribution
It proposes a novel detail-preserving Transformer (DPT) that models long-range dependencies in light field sequences and leverages gradient maps for enhanced detail restoration.
Findings
Achieves superior super-resolution performance on multiple light field datasets.
Effectively models global relations and local details using the proposed Transformer architecture.
Outperforms state-of-the-art light field super-resolution methods.
Abstract
Recently, numerous algorithms have been developed to tackle the problem of light field super-resolution (LFSR), i.e., super-resolving low-resolution light fields to gain high-resolution views. Despite delivering encouraging results, these approaches are all convolution-based, and are naturally weak in global relation modeling of sub-aperture images necessarily to characterize the inherent structure of light fields. In this paper, we put forth a novel formulation built upon Transformers, by treating LFSR as a sequence-to-sequence reconstruction task. In particular, our model regards sub-aperture images of each vertical or horizontal angular view as a sequence, and establishes long-range geometric dependencies within each sequence via a spatial-angular locally-enhanced self-attention layer, which maintains the locality of each sub-aperture image as well. Additionally, to better recover…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
TopicsAdvanced Vision and Imaging · Advanced Image Processing Techniques · Advanced Fluorescence Microscopy Techniques
MethodsMulti-Head Attention · Attention Is All You Need · Linear Layer · Convolution · Label Smoothing · Six Ways To Communicate To Someone At Expedia Via Phone And Email's. · Absolute Position Encodings · Residual Connection · Softmax · Adam
