DualGazeNet: A Biologically Inspired Dual-Gaze Query Network for Salient Object Detection
Yu Zhang, Haoan Ping, Yuchen Li, Zhenshan Bing, Fuchun Sun, Alois Knoll

TL;DR
DualGazeNet is a simple, biologically inspired Transformer-based model for salient object detection that outperforms complex architectures in accuracy, speed, and efficiency, inspired by human visual processing.
Contribution
This work introduces DualGazeNet, a novel biologically inspired Transformer framework that models dual-pathway processing and attention, achieving state-of-the-art results with reduced complexity.
Findings
Surpasses 25 state-of-the-art methods in RGB SOD benchmarks.
Achieves 60% higher inference speed and 53.4% fewer FLOPs than similar Transformer baselines.
Demonstrates strong cross-domain generalization on camouflaged and underwater SOD tasks.
Abstract
Recent salient object detection (SOD) methods aim to improve performance in four key directions: semantic enhancement, boundary refinement, auxiliary task supervision, and multi-modal fusion. In pursuit of continuous gains, these approaches have evolved toward increasingly sophisticated architectures with multi-stage pipelines, specialized fusion modules, edge-guided learning, and elaborate attention mechanisms. However, this complexity paradoxically introduces feature redundancy and cross-component interference that obscure salient cues, ultimately reaching performance bottlenecks. In contrast, human vision achieves efficient salient object identification without such architectural complexity. This contrast raises a fundamental question: can we design a biologically grounded yet architecturally simple SOD framework that dispenses with most of this engineering complexity, while…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsVisual Attention and Saliency Detection · Advanced Neural Network Applications · Face Recognition and Perception
