Concept Guided Co-salient Object Detection
Jiayi Zhu, Qing Guo, Felix Juefei-Xu, Yihao Huang, Yang Liu, Geguang Pu

TL;DR
This paper introduces ConceptCoSOD, a novel framework that leverages high-level semantic concepts extracted from text to improve co-salient object detection across images, especially under challenging conditions.
Contribution
It proposes a concept-guided approach with a resampling strategy for better semantic prior extraction, significantly enhancing detection accuracy and robustness.
Findings
Outperforms existing methods on benchmark datasets
Achieves higher accuracy in challenging visual conditions
Demonstrates strong generalization across corrupted settings
Abstract
Co-salient object detection (Co-SOD) aims to identify common salient objects across a group of related images. While recent methods have made notable progress, they typically rely on low-level visual patterns and lack semantic priors, limiting their detection performance. We propose ConceptCoSOD, a concept-guided framework that introduces high-level semantic knowledge to enhance co-saliency detection. By extracting shared text-based concepts from the input image group, ConceptCoSOD provides semantic guidance that anchors the detection process. To further improve concept quality, we analyze the effect of diffusion timesteps and design a resampling strategy that selects more informative steps for learning robust concepts. This semantic prior, combined with the resampling-enhanced representation, enables accurate and consistent segmentation even in challenging visual conditions. Extensive…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsVisual Attention and Saliency Detection
MethodsFocus
