Visual Attention Prediction Improves Performance of Autonomous Drone Racing Agents
Christian Pfeiffer, Simon Wengeler, Antonio Loquercio, Davide, Scaramuzza

TL;DR
This study shows that incorporating human-like visual attention prediction into neural networks significantly enhances the performance of autonomous drone racing agents, surpassing traditional image-based methods.
Contribution
The paper introduces a novel attention prediction model based on human gaze data to improve neural network control in drone racing tasks.
Findings
Attention-prediction controller achieved 88% success rate in racing
Outperformed raw image and feature-track baselines
Showed better generalization on hold-out trajectories
Abstract
Humans race drones faster than neural networks trained for end-to-end autonomous flight. This may be related to the ability of human pilots to select task-relevant visual information effectively. This work investigates whether neural networks capable of imitating human eye gaze behavior and attention can improve neural network performance for the challenging task of vision-based autonomous drone racing. We hypothesize that gaze-based attention prediction can be an efficient mechanism for visual information selection and decision making in a simulator-based drone racing task. We test this hypothesis using eye gaze and flight trajectory data from 18 human drone pilots to train a visual attention prediction model. We then use this visual attention prediction model to train an end-to-end controller for vision-based autonomous drone racing using imitation learning. We compare the drone…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
