Active Gaze Control for Foveal Scene Exploration
Alexandre M.F. Dias, Lu\'is Sim\~oes, Plinio Moreno, Alexandre, Bernardino

TL;DR
This paper presents a method for active gaze control in scene exploration that emulates human visual behavior, using deep learning and information theory to efficiently identify objects with minimal gaze shifts.
Contribution
The paper introduces a novel approach combining calibrated deep object detection, semantic mapping, and information-theoretic gaze planning for foveal scene exploration.
Findings
Increases detection F1-score by 2-3 percentage points over random gaze shifts.
Reduces the number of gaze shifts to one third to achieve similar detection performance.
Demonstrates effective scene exploration with fewer gaze movements.
Abstract
Active perception and foveal vision are the foundations of the human visual system. While foveal vision reduces the amount of information to process during a gaze fixation, active perception will change the gaze direction to the most promising parts of the visual field. We propose a methodology to emulate how humans and robots with foveal cameras would explore a scene, identifying the objects present in their surroundings with in least number of gaze shifts. Our approach is based on three key methods. First, we take an off-the-shelf deep object detector, pre-trained on a large dataset of regular images, and calibrate the classification outputs to the case of foveated images. Second, a body-centered semantic map, encoding the objects classifications and corresponding uncertainties, is sequentially updated with the calibrated detections, considering several data fusion techniques. Third,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGaze Tracking and Assistive Technology · Visual Attention and Saliency Detection · Image Processing Techniques and Applications
