SalGAN: Visual Saliency Prediction with Generative Adversarial Networks

Junting Pan; Cristian Canton Ferrer; Kevin McGuinness; Noel E.; O'Connor; Jordi Torres; Elisa Sayrol; Xavier Giro-i-Nieto

arXiv:1701.01081·cs.CV·July 3, 2018·337 cites

SalGAN: Visual Saliency Prediction with Generative Adversarial Networks

Junting Pan, Cristian Canton Ferrer, Kevin McGuinness, Noel E., O'Connor, Jordi Torres, Elisa Sayrol, Xavier Giro-i-Nieto

PDF

Open Access 4 Repos

TL;DR

SalGAN is a deep learning model that uses adversarial training to improve the accuracy of visual saliency prediction, achieving state-of-the-art results across multiple metrics.

Contribution

This paper introduces SalGAN, the first to apply adversarial training with GANs to enhance visual saliency prediction accuracy.

Findings

01

Achieves state-of-the-art performance on saliency benchmarks

02

Adversarial training improves prediction quality

03

Source code and models are publicly available

Abstract

We introduce SalGAN, a deep convolutional neural network for visual saliency prediction trained with adversarial examples. The first stage of the network consists of a generator model whose weights are learned by back-propagation computed from a binary cross entropy (BCE) loss over downsampled versions of the saliency maps. The resulting prediction is processed by a discriminator network trained to solve a binary classification task between the saliency maps generated by the generative stage and the ground truth ones. Our experiments show how adversarial training allows reaching state-of-the-art performance across different metrics when combined with a widely-used loss function like BCE. Our results can be reproduced with the source code and trained models available at https://imatge-upc.github.io/saliency-salgan-2017/.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsVisual Attention and Saliency Detection · Image and Video Quality Assessment · Visual perception and processing mechanisms