FastSal: a Computationally Efficient Network for Visual Saliency Prediction
Feiyan Hu, Kevin McGuinness

TL;DR
FastSal introduces a computationally efficient neural network for visual saliency prediction, leveraging MobileNetV2 and knowledge transfer techniques to achieve competitive accuracy with reduced complexity.
Contribution
The paper demonstrates that MobileNetV2 can serve as an effective backbone for saliency prediction and shows how pseudo-labeling from complex models enhances efficiency.
Findings
MobileNetV2 outperforms other architectures as a backbone.
Pseudo-labeling from DeepGaze II improves performance.
FastSal achieves comparable accuracy with less computational cost.
Abstract
This paper focuses on the problem of visual saliency prediction, predicting regions of an image that tend to attract human visual attention, under a constrained computational budget. We modify and test various recent efficient convolutional neural network architectures like EfficientNet and MobileNetV2 and compare them with existing state-of-the-art saliency models such as SalGAN and DeepGaze II both in terms of standard accuracy metrics like AUC and NSS, and in terms of the computational complexity and model size. We find that MobileNetV2 makes an excellent backbone for a visual saliency model and can be effective even without a complex decoder. We also show that knowledge transfer from a more computationally expensive model like DeepGaze II can be achieved via pseudo-labelling an unlabelled dataset, and that this approach gives result on-par with many state-of-the-art algorithms with…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsVisual Attention and Saliency Detection · Image and Video Quality Assessment · Image Enhancement Techniques
MethodsRMSProp · Depthwise Convolution · Pointwise Convolution · Depthwise Separable Convolution · Batch Normalization · *Communicated@Fast*How Do I Communicate to Expedia? · Inverted Residual Block · Sigmoid Activation · Average Pooling · Convolution
