Salient Object Detection From Arbitrary Modalities
Nianchang Huang, Yang Yang, Ruida Xi, Qiang Zhang, Jungong Han, Jin, Huang

TL;DR
This paper introduces Arbitrary Modality Salient Object Detection (AM SOD), a new task that handles varying input types and numbers, proposing a modality switch network and a new dataset to improve robustness across diverse modalities.
Contribution
The paper proposes a novel AM SOD task, a modality switch network with feature extraction and dynamic fusion, and introduces the AM-XD dataset for research on arbitrary modality inputs.
Findings
The proposed method effectively handles different modality types and numbers.
Experimental results show robustness in diverse real-world scenarios.
The new dataset facilitates future research in AM SOD.
Abstract
Toward desirable saliency prediction, the types and numbers of inputs for a salient object detection (SOD) algorithm may dynamically change in many real-life applications. However, existing SOD algorithms are mainly designed or trained for one particular type of inputs, failing to be generalized to other types of inputs. Consequentially, more types of SOD algorithms need to be prepared in advance for handling different types of inputs, raising huge hardware and research costs. Differently, in this paper, we propose a new type of SOD task, termed Arbitrary Modality SOD (AM SOD). The most prominent characteristics of AM SOD are that the modality types and modality numbers will be arbitrary or dynamically changed. The former means that the inputs to the AM SOD algorithm may be arbitrary modalities such as RGB, depths, or even any combination of them. While, the latter indicates that the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsVisual Attention and Saliency Detection
MethodsAttention Is All You Need · Dense Connections · Dropout · Label Smoothing · Residual Connection · Softmax · Attention Model · Position-Wise Feed-Forward Layer · Byte Pair Encoding · Absolute Position Encodings
