Less is More: Skim Transformer for Light Field Image Super-resolution
Zeke Zexi Hu, Haodong Chen, Hui Ye, Xiaoming Chen, Vera Yuk Ying Chung, Yiran Shen, Weidong Cai

TL;DR
This paper introduces the Skim Transformer and SkimLFSR, a lightweight, disparity-aware network for light field image super-resolution that achieves state-of-the-art results with fewer parameters by selectively attending to relevant sub-aperture images.
Contribution
The paper proposes a novel multi-branch Skim Transformer architecture that reduces redundancy and improves efficiency in light field super-resolution by focusing on specific disparity ranges.
Findings
Achieves state-of-the-art super-resolution performance with only 67% of previous parameters.
Demonstrates effective disparity-aware attention behavior guided by predefined SAI subsets.
Shows strong generalizability across different angular resolutions without retraining.
Abstract
A light field image captures scenes through its micro-lens array, providing a rich representation that encompasses spatial and angular information. While this richness comes at significant data redundancy, most existing methods tend to indiscriminately utilize all the information from sub-aperture images (SAIs) in an attempt to harness every visual cue regardless of their disparity significance. However, this paradigm inevitably leads to disparity entanglement, a fundamental cause of inefficiency in light field image processing. To address this limitation, we introduce the Skim Transformer, a novel architecture inspired by the "less is more" philosophy. It features a multi-branch structure where each branch is dedicated to a specific disparity range by constructing its attention score matrix over a skimmed subset of SAIs, rather than all of them. Building upon it, we present SkimLFSR,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Image Fusion Techniques · Photoacoustic and Ultrasonic Imaging · Optical Coherence Tomography Applications
MethodsAttention Is All You Need · Byte Pair Encoding · Layer Normalization · Label Smoothing · Linear Layer · Softmax · Position-Wise Feed-Forward Layer · Absolute Position Encodings · Multi-Head Attention · Dense Connections
