Dynamic Open Vocabulary Enhanced Safe-landing with Intelligence (DOVESEI)
Haechan Mark Bong, Rongge Zhang, Ricardo de Azambuja and, Giovanni Beltrame

TL;DR
This paper introduces DOVESEI, a reactive UAV system utilizing open vocabulary image segmentation and a dynamic focus mechanism to enhance safe landing capabilities at low altitudes, significantly improving success rates.
Contribution
The work presents a novel open vocabulary segmentation approach combined with a dynamic focus mechanism for safe UAV landings, reducing the need for extensive data and improving reliability.
Findings
Successful landing at altitudes as low as 20 meters.
Nearly tenfold increase in landing success rate with the dynamic focus.
Open source implementation available online.
Abstract
This work targets what we consider to be the foundational step for urban airborne robots, a safe landing. Our attention is directed toward what we deem the most crucial aspect of the safe landing perception stack: segmentation. We present a streamlined reactive UAV system that employs visual servoing by harnessing the capabilities of open vocabulary image segmentation. This approach can adapt to various scenarios with minimal adjustments, bypassing the necessity for extensive data accumulation for refining internal models, thanks to its open vocabulary methodology. Given the limitations imposed by local authorities, our primary focus centers on operations originating from altitudes of 100 meters. This choice is deliberate, as numerous preceding works have dealt with altitudes up to 30 meters, aligning with the capabilities of small stereo cameras. Consequently, we leave the remaining…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRobotics and Sensor-Based Localization · Robotic Path Planning Algorithms · Advanced Vision and Imaging
MethodsFocus
