TL;DR
This paper introduces a biologically-inspired foveated object detector that mimics human vision by allocating resources based on visual acuity, achieving comparable detection performance to traditional methods with reduced computational costs.
Contribution
The paper proposes a novel foveated object detection approach combining human visual system principles with modern detectors, offering an efficient alternative to sliding window methods.
Findings
FOD performs comparably to sliding window detectors on PASCAL VOC 2007.
FOD achieves significant computational cost savings.
FOD effectively guides eye movements using retino-specific classifiers.
Abstract
We present a foveated object detector (FOD) as a biologically-inspired alternative to the sliding window (SW) approach which is the dominant method of search in computer vision object detection. Similar to the human visual system, the FOD has higher resolution at the fovea and lower resolution at the visual periphery. Consequently, more computational resources are allocated at the fovea and relatively fewer at the periphery. The FOD processes the entire scene, uses retino-specific object detection classifiers to guide eye movements, aligns its fovea with regions of interest in the input image and integrates observations across multiple fixations. Our approach combines modern object detectors from computer vision with a recent model of peripheral pooling regions found at the V1 layer of the human visual system. We assessed various eye movement strategies on the PASCAL VOC 2007 dataset…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
