Adaptive Low Rank Adaptation of Segment Anything to Salient Object   Detection

Ruikai Cui; Siyuan He; Shi Qiu

arXiv:2308.05426·cs.CV·August 11, 2023·1 cites

Adaptive Low Rank Adaptation of Segment Anything to Salient Object Detection

Ruikai Cui, Siyuan He, Shi Qiu

PDF

Open Access 1 Repo

TL;DR

This paper introduces SSOM, an adaptive method that fine-tunes the Segment Anything Model to improve salient object detection, leveraging low-rank structures for enhanced performance across multiple benchmarks.

Contribution

It presents a novel adaptive fine-tuning approach for SAM specifically tailored to salient object detection, utilizing low-rank structures for better accuracy.

Findings

01

Outperforms state-of-the-art methods on five RGB benchmark datasets

02

Demonstrates superior qualitative and quantitative results

03

Effectively adapts SAM for salient object detection

Abstract

Foundation models, such as OpenAI's GPT-3 and GPT-4, Meta's LLaMA, and Google's PaLM2, have revolutionized the field of artificial intelligence. A notable paradigm shift has been the advent of the Segment Anything Model (SAM), which has exhibited a remarkable capability to segment real-world objects, trained on 1 billion masks and 11 million images. Although SAM excels in general object segmentation, it lacks the intrinsic ability to detect salient objects, resulting in suboptimal performance in this domain. To address this challenge, we present the Segment Salient Object Model (SSOM), an innovative approach that adaptively fine-tunes SAM for salient object detection by harnessing the low-rank structure inherent in deep learning. Comprehensive qualitative and quantitative evaluations across five challenging RGB benchmark datasets demonstrate the superior performance of our approach,…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

CuiRuikai/SAM-SOD
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsVisual Attention and Saliency Detection · Virtual Reality Applications and Impacts · Face Recognition and Perception

MethodsRefunds@Expedia|||How do I get a full refund from Expedia? · Multi-Head Attention · Attention Is All You Need · Segment Anything Model · Absolute Position Encodings · Label Smoothing · Linear Layer · Byte Pair Encoding · Attention Dropout · Cosine Annealing