Vision-Based End-to-End Learning for UAV Traversal of Irregular Gaps via Differentiable Simulation
Linzuo Zhang, Yu Hu, Feng Yu, Yang Deng, Wenxian Yu, Danping Zou

TL;DR
This paper introduces a vision-based end-to-end method for autonomous drones to traverse complex, irregular gaps using differentiable simulation and auxiliary modules, demonstrating robustness in simulation and real-world tests.
Contribution
The work presents a novel fully vision-based framework that directly maps depth images to control commands for gap traversal, incorporating differentiable simulation and auxiliary predictors.
Findings
Effective in simulation and real-world environments
Generalizes to unseen complex gaps
Enhances safety with auxiliary prediction modules
Abstract
-Navigation through narrow and irregular gaps is an essential skill in autonomous drones for applications such as inspection, search-and-rescue, and disaster response. However, traditional planning and control methods rely on explicit gap extraction and measurement, while recent end-to-end approaches often assume regularly shaped gaps, leading to poor generalization and limited practicality. In this work, we present a fully vision-based, end-to-end framework that maps depth images directly to control commands, enabling drones to traverse complex gaps within unseen environments. Operating in the Special Euclidean group SE(3), where position and orientation are tightly coupled, the framework leverages differentiable simulation, a Stop-Gradient operator, and a Bimodal Initialization Distribution to achieve stable traversal through consecutive gaps. Two auxiliary prediction modules-a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
