LFSamba: Marry SAM with Mamba for Light Field Salient Object Detection

Zhengyi Liu; Longzhen Wang; Xianyong Fang; Zhengzheng Tu; Linbo Wang

arXiv:2411.06652·cs.CV·November 12, 2024

LFSamba: Marry SAM with Mamba for Light Field Salient Object Detection

Zhengyi Liu, Longzhen Wang, Xianyong Fang, Zhengzheng Tu, Linbo Wang

PDF

Open Access 1 Repo

TL;DR

LFSamba introduces a novel light field salient object detection model that combines SAM and Mamba for efficient feature extraction, multi-modal and inter-slice relation modeling, and weakly supervised learning, advancing the state-of-the-art in light field analysis.

Contribution

The paper presents a new model LFSamba that integrates SAM and Mamba to improve multi-focus light field salient object detection with weak supervision.

Findings

01

Effective feature extraction with SAM

02

Long-range inter-slice dependency modeling with Mamba

03

First scribble-supervised baseline for light field detection

Abstract

A light field camera can reconstruct 3D scenes using captured multi-focus images that contain rich spatial geometric information, enhancing applications in stereoscopic photography, virtual reality, and robotic vision. In this work, a state-of-the-art salient object detection model for multi-focus light field images, called LFSamba, is introduced to emphasize four main insights: (a) Efficient feature extraction, where SAM is used to extract modality-aware discriminative features; (b) Inter-slice relation modeling, leveraging Mamba to capture long-range dependencies across multiple focal slices, thus extracting implicit depth cues; (c) Inter-modal relation modeling, utilizing Mamba to integrate all-focus and multi-focus images, enabling mutual enhancement; (d) Weakly supervised learning capability, developing a scribble annotation dataset from an existing pixel-level mask dataset,…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

liuzywen/lfscribble
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsInfrared Target Detection Methodologies

MethodsMamba: Linear-Time Sequence Modeling with Selective State Spaces · Segment Anything Model