Bi-directional Object-context Prioritization Learning for Saliency Ranking
Xin Tian, Ke Xu, Xin Yang, Lin Du, Baocai Yin, Rynson W.H. Lau

TL;DR
This paper introduces a bi-directional model for saliency ranking that combines object-based and spatial attention mechanisms, inspired by human visual recognition, to improve the accuracy of identifying and ranking salient objects in scenes.
Contribution
It proposes a novel bi-directional framework with SOS and OCOR modules to unify spatial and object-based attention for more realistic saliency ranking.
Findings
Outperforms existing state-of-the-art methods.
Effectively models object-context interactions.
Enhances saliency ranking accuracy.
Abstract
The saliency ranking task is recently proposed to study the visual behavior that humans would typically shift their attention over different objects of a scene based on their degrees of saliency. Existing approaches focus on learning either object-object or object-scene relations. Such a strategy follows the idea of object-based attention in Psychology, but it tends to favor those objects with strong semantics (e.g., humans), resulting in unrealistic saliency ranking. We observe that spatial attention works concurrently with object-based attention in the human visual recognition system. During the recognition process, the human spatial attention mechanism would move, engage, and disengage from region to region (i.e., context to context). This inspires us to model the region-level interactions, in addition to the object-level reasoning, for saliency ranking. To this end, we propose a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsVisual Attention and Saliency Detection · Advanced Image and Video Retrieval Techniques · Face Recognition and Perception
