TL;DR
This paper introduces a Transformer-based approach for light field image super-resolution, effectively capturing both angular and spatial dependencies to improve image quality with low computational cost.
Contribution
The paper proposes a novel angular and spatial Transformer framework for light field super-resolution, surpassing CNN-based methods in performance and efficiency.
Findings
Achieves superior super-resolution results on five public datasets.
Uses a small model size with low computational cost.
Demonstrates effectiveness through extensive ablation studies.
Abstract
Light field (LF) image super-resolution (SR) aims at reconstructing high-resolution LF images from their low-resolution counterparts. Although CNN-based methods have achieved remarkable performance in LF image SR, these methods cannot fully model the non-local properties of the 4D LF data. In this paper, we propose a simple but effective Transformer-based method for LF image SR. In our method, an angular Transformer is designed to incorporate complementary information among different views, and a spatial Transformer is developed to capture both local and long-range dependencies within each sub-aperture image. With the proposed angular and spatial Transformers, the beneficial information in an LF can be fully exploited and the SR performance is boosted. We validate the effectiveness of our angular and spatial Transformers through extensive ablation studies, and compare our method to…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
MethodsAttention Is All You Need · Linear Layer · Absolute Position Encodings · Position-Wise Feed-Forward Layer · Byte Pair Encoding · Multi-Head Attention · Softmax · Layer Normalization · Label Smoothing · Spatial Transformer
