A Fixation-based 360{\deg} Benchmark Dataset for Salient Object Detection
Yi Zhang, Lu Zhang, Wassim Hamidouche, Olivier Deforges

TL;DR
This paper introduces a new 360-degree image dataset with pixel-level annotations for salient object detection, highlighting the challenges of applying current models to panoramic images and providing a benchmark for future research.
Contribution
The authors created and annotated a comprehensive 360-degree dataset for salient object detection, filling a gap in available resources for panoramic image analysis.
Findings
Current SOD models perform poorly on panoramic images.
The dataset reveals limitations of existing methods in 360-degree SOD.
Benchmark results highlight the need for specialized models for panoramic scenes.
Abstract
Fixation prediction (FP) in panoramic contents has been widely investigated along with the booming trend of virtual reality (VR) applications. However, another issue within the field of visual saliency, salient object detection (SOD), has been seldom explored in 360{\deg} (or omnidirectional) images due to the lack of datasets representative of real scenes with pixel-level annotations. Toward this end, we collect 107 equirectangular panoramas with challenging scenes and multiple object classes. Based on the consistency between FP and explicit saliency judgements, we further manually annotate 1,165 salient objects over the collected images with precise masks under the guidance of real human eye fixation maps. Six state-of-the-art SOD models are then benchmarked on the proposed fixation-based 360{\deg} image dataset (F-360iSOD), by applying a multiple cubic projection-based fine-tuning…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsVisual Attention and Saliency Detection · Olfactory and Sensory Function Studies · Advanced Image and Video Retrieval Techniques
