Energy-Based Generative Cooperative Saliency Prediction
Jing Zhang, Jianwen Xie, Zilong Zheng, Nick Barnes

TL;DR
This paper introduces a generative cooperative framework combining a latent variable model and an energy-based model for saliency prediction, capturing uncertainty and producing diverse, high-quality saliency maps.
Contribution
It proposes a novel coarse-to-fine cooperative approach with energy-based refinement, including a weakly supervised learning strategy and a refinement module for existing models.
Findings
Achieves state-of-the-art results in fully supervised saliency prediction.
Effectively models uncertainty and generates diverse saliency maps.
Improves weakly supervised saliency prediction performance.
Abstract
Conventional saliency prediction models typically learn a deterministic mapping from an image to its saliency map, and thus fail to explain the subjective nature of human attention. In this paper, to model the uncertainty of visual saliency, we study the saliency prediction problem from the perspective of generative models by learning a conditional probability distribution over the saliency map given an input image, and treating the saliency prediction as a sampling process from the learned distribution. Specifically, we propose a generative cooperative saliency prediction framework, where a conditional latent variable model (LVM) and a conditional energy-based model (EBM) are jointly trained to predict salient objects in a cooperative manner. The LVM serves as a fast but coarse predictor to efficiently produce an initial saliency map, which is then refined by the iterative Langevin…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsVisual Attention and Saliency Detection · Face Recognition and Perception · Aesthetic Perception and Analysis
Methodsenergy-based model
