FastSal: a Computationally Efficient Network for Visual Saliency   Prediction

Feiyan Hu; Kevin McGuinness

arXiv:2008.11151·cs.CV·August 26, 2020

FastSal: a Computationally Efficient Network for Visual Saliency Prediction

Feiyan Hu, Kevin McGuinness

PDF

Open Access 1 Repo

TL;DR

FastSal introduces a computationally efficient neural network for visual saliency prediction, leveraging MobileNetV2 and knowledge transfer techniques to achieve competitive accuracy with reduced complexity.

Contribution

The paper demonstrates that MobileNetV2 can serve as an effective backbone for saliency prediction and shows how pseudo-labeling from complex models enhances efficiency.

Findings

01

MobileNetV2 outperforms other architectures as a backbone.

02

Pseudo-labeling from DeepGaze II improves performance.

03

FastSal achieves comparable accuracy with less computational cost.

Abstract

This paper focuses on the problem of visual saliency prediction, predicting regions of an image that tend to attract human visual attention, under a constrained computational budget. We modify and test various recent efficient convolutional neural network architectures like EfficientNet and MobileNetV2 and compare them with existing state-of-the-art saliency models such as SalGAN and DeepGaze II both in terms of standard accuracy metrics like AUC and NSS, and in terms of the computational complexity and model size. We find that MobileNetV2 makes an excellent backbone for a visual saliency model and can be effective even without a complex decoder. We also show that knowledge transfer from a more computationally expensive model like DeepGaze II can be achieved via pseudo-labelling an unlabelled dataset, and that this approach gives result on-par with many state-of-the-art algorithms with…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

feiyanhu/FastSal
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsVisual Attention and Saliency Detection · Image and Video Quality Assessment · Image Enhancement Techniques

MethodsRMSProp · Depthwise Convolution · Pointwise Convolution · Depthwise Separable Convolution · Batch Normalization · *Communicated@Fast*How Do I Communicate to Expedia? · Inverted Residual Block · Sigmoid Activation · Average Pooling · Convolution