LFSamba: Marry SAM with Mamba for Light Field Salient Object Detection
Zhengyi Liu, Longzhen Wang, Xianyong Fang, Zhengzheng Tu, Linbo Wang

TL;DR
LFSamba introduces a novel light field salient object detection model that combines SAM and Mamba for efficient feature extraction, multi-modal and inter-slice relation modeling, and weakly supervised learning, advancing the state-of-the-art in light field analysis.
Contribution
The paper presents a new model LFSamba that integrates SAM and Mamba to improve multi-focus light field salient object detection with weak supervision.
Findings
Effective feature extraction with SAM
Long-range inter-slice dependency modeling with Mamba
First scribble-supervised baseline for light field detection
Abstract
A light field camera can reconstruct 3D scenes using captured multi-focus images that contain rich spatial geometric information, enhancing applications in stereoscopic photography, virtual reality, and robotic vision. In this work, a state-of-the-art salient object detection model for multi-focus light field images, called LFSamba, is introduced to emphasize four main insights: (a) Efficient feature extraction, where SAM is used to extract modality-aware discriminative features; (b) Inter-slice relation modeling, leveraging Mamba to capture long-range dependencies across multiple focal slices, thus extracting implicit depth cues; (c) Inter-modal relation modeling, utilizing Mamba to integrate all-focus and multi-focus images, enabling mutual enhancement; (d) Weakly supervised learning capability, developing a scribble annotation dataset from an existing pixel-level mask dataset,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsInfrared Target Detection Methodologies
MethodsMamba: Linear-Time Sequence Modeling with Selective State Spaces · Segment Anything Model
