SalyPath360: Saliency and Scanpath Prediction Framework for Omnidirectional Images
Mohamed Amine Kerkouri, Marouane Tliba, Aladine Chetouani, Mohamed, Sayeh

TL;DR
This paper presents SalyPath360, a novel neural network framework that simultaneously predicts saliency maps and scanpaths for omnidirectional images, improving attention prediction accuracy.
Contribution
It introduces a combined encoder-decoder architecture with attention and auxiliary modules for joint saliency and scanpath prediction in 360-degree images.
Findings
Outperforms state-of-the-art methods on Salient360! dataset
Effectively predicts both saliency maps and scanpaths
Enhances understanding of visual attention in omnidirectional images
Abstract
This paper introduces a new framework to predict visual attention of omnidirectional images. The key setup of our architecture is the simultaneous prediction of the saliency map and a corresponding scanpath for a given stimulus. The framework implements a fully encoder-decoder convolutional neural network augmented by an attention module to generate representative saliency maps. In addition, an auxiliary network is employed to generate probable viewport center fixation points through the SoftArgMax function. The latter allows to derive fixation points from feature maps. To take advantage of the scanpath prediction, an adaptive joint probability distribution model is then applied to construct the final unbiased saliency map by leveraging the encoder decoder-based saliency map and the scanpath-based saliency heatmap. The proposed framework was evaluated in terms of saliency and scanpath…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsVisual Attention and Saliency Detection · Image and Video Quality Assessment · Advanced Image Fusion Techniques
MethodsMax Pooling · Average Pooling · Convolution · Sigmoid Activation · Communication--Guide||How Do I Communicate to Expedia?
