View-aware Salient Object Detection for 360{\deg} Omnidirectional Image
Junjie Wu, Changqun Xia, Tianshu Yu, Jia Li

TL;DR
This paper introduces a large-scale 360-degree omnidirectional image dataset for salient object detection, and proposes a view-aware detection method that effectively handles panoramic distortions, discontinuities, and scale variations.
Contribution
The paper creates the largest 360-degree ISOD dataset with pixel-wise annotations and proposes a novel view-aware detection method using a Sample Adaptive View Transformer.
Findings
The dataset is the largest and most challenging for 360-degree ISOD.
The proposed method outperforms 20 state-of-the-art ISOD techniques.
Experiments demonstrate the effectiveness of view-aware features in panoramic scenarios.
Abstract
Image-based salient object detection (ISOD) in 360{\deg} scenarios is significant for understanding and applying panoramic information. However, research on 360{\deg} ISOD has not been widely explored due to the lack of large, complex, high-resolution, and well-labeled datasets. Towards this end, we construct a large scale 360{\deg} ISOD dataset with object-level pixel-wise annotation on equirectangular projection (ERP), which contains rich panoramic scenes with not less than 2K resolution and is the largest dataset for 360{\deg} ISOD by far to our best knowledge. By observing the data, we find current methods face three significant challenges in panoramic scenarios: diverse distortion degrees, discontinuous edge effects and changeable object scales. Inspired by humans' observing process, we propose a view-aware salient object detection method based on a Sample Adaptive View Transformer…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
MethodsMulti-Head Attention · Attention Is All You Need · Linear Layer · Byte Pair Encoding · Softmax · Dropout · Dense Connections · Residual Connection · Absolute Position Encodings · Position-Wise Feed-Forward Layer
