Ventral-Dorsal Neural Networks: Object Detection via Selective Attention
Mohammad K. Ebrahimpour, Jiayun Li, Yen-Yun Yu, Jackson L. Reese,, Azadeh Moghtaderi, Ming-Hsuan Yang, David C. Noelle

TL;DR
This paper introduces Ventral-Dorsal Networks (VDNets), a biologically inspired dual neural pathway framework that enhances object detection by mimicking human visual processing, leading to significant performance improvements on standard datasets.
Contribution
The paper presents a novel dual network architecture inspired by the human visual system, integrating object recognition and localization streams for improved detection accuracy.
Findings
Outperforms state-of-the-art on PASCAL VOC 2007 by 8% mAP
Achieves 3% higher mAP on PASCAL VOC 2012
Demonstrates qualitative and quantitative benefits on Yearbook images
Abstract
Deep Convolutional Neural Networks (CNNs) have been repeatedly proven to perform well on image classification tasks. Object detection methods, however, are still in need of significant improvements. In this paper, we propose a new framework called Ventral-Dorsal Networks (VDNets) which is inspired by the structure of the human visual system. Roughly, the visual input signal is analyzed along two separate neural streams, one in the temporal lobe and the other in the parietal lobe. The coarse functional distinction between these streams is between object recognition -- the "what" of the signal -- and extracting location related information -- the "where" of the signal. The ventral pathway from primary visual cortex, entering the temporal lobe, is dominated by "what" information, while the dorsal pathway, into the parietal lobe, is dominated by "where" information. Inspired by this…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsVisual Attention and Saliency Detection · Advanced Neural Network Applications · Industrial Vision Systems and Defect Detection
