AeroDuo: Aerial Duo for UAV-based Vision and Language Navigation
Ruipu Wu, Yige Zhang, Jinyu Chen, Linjiang Huang, Shifeng Zhang, Xu Zhou, Liang Wang, and Si Liu

TL;DR
AeroDuo introduces a dual-UAV system with high and low-altitude drones collaborating for vision and language navigation, supported by a new dataset and a framework that leverages multi-level reasoning and efficient communication.
Contribution
The paper proposes a novel dual-UAV collaborative navigation task, a new dataset HaL-13k, and an integrated framework combining large language models and lightweight policies for improved UAV navigation.
Findings
Successful demonstration of dual-UAV collaboration in complex environments.
The HaL-13k dataset enables systematic evaluation of UAV-VLN models.
AeroDuo achieves better navigation accuracy with minimal communication.
Abstract
Aerial Vision-and-Language Navigation (VLN) is an emerging task that enables Unmanned Aerial Vehicles (UAVs) to navigate outdoor environments using natural language instructions and visual cues. However, due to the extended trajectories and complex maneuverability of UAVs, achieving reliable UAV-VLN performance is challenging and often requires human intervention or overly detailed instructions. To harness the advantages of UAVs' high mobility, which could provide multi-grained perspectives, while maintaining a manageable motion space for learning, we introduce a novel task called Dual-Altitude UAV Collaborative VLN (DuAl-VLN). In this task, two UAVs operate at distinct altitudes: a high-altitude UAV responsible for broad environmental reasoning, and a low-altitude UAV tasked with precise navigation. To support the training and evaluation of the DuAl-VLN, we construct the HaL-13k, a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMultimodal Machine Learning Applications · Advanced Neural Network Applications · UAV Applications and Optimization
