Concept Guided Co-salient Object Detection

Jiayi Zhu; Qing Guo; Felix Juefei-Xu; Yihao Huang; Yang Liu; Geguang Pu

arXiv:2412.16609·cs.CV·June 18, 2025

Concept Guided Co-salient Object Detection

Jiayi Zhu, Qing Guo, Felix Juefei-Xu, Yihao Huang, Yang Liu, Geguang Pu

PDF

Open Access

TL;DR

This paper introduces ConceptCoSOD, a novel framework that leverages high-level semantic concepts extracted from text to improve co-salient object detection across images, especially under challenging conditions.

Contribution

It proposes a concept-guided approach with a resampling strategy for better semantic prior extraction, significantly enhancing detection accuracy and robustness.

Findings

01

Outperforms existing methods on benchmark datasets

02

Achieves higher accuracy in challenging visual conditions

03

Demonstrates strong generalization across corrupted settings

Abstract

Co-salient object detection (Co-SOD) aims to identify common salient objects across a group of related images. While recent methods have made notable progress, they typically rely on low-level visual patterns and lack semantic priors, limiting their detection performance. We propose ConceptCoSOD, a concept-guided framework that introduces high-level semantic knowledge to enhance co-saliency detection. By extracting shared text-based concepts from the input image group, ConceptCoSOD provides semantic guidance that anchors the detection process. To further improve concept quality, we analyze the effect of diffusion timesteps and design a resampling strategy that selects more informative steps for learning robust concepts. This semantic prior, combined with the resampling-enhanced representation, enables accurate and consistent segmentation even in challenging visual conditions. Extensive…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsVisual Attention and Saliency Detection

MethodsFocus