AirHunt: Bridging VLM Semantics and Continuous Planning for Efficient Aerial Object Navigation
Xuecheng Chen, Zongzhuo Liu, Jianfa Ma, Bang Du, Tiantian Zhang, Xueqian Wang, and Boyu Zhou

TL;DR
AirHunt is a novel aerial navigation system that combines large Vision-Language Models with continuous planning to efficiently locate objects in outdoor environments, adapting semantic guidance dynamically for improved success and efficiency.
Contribution
This work introduces a dual-pathway asynchronous architecture and active reasoning modules that effectively integrate VLM semantics with real-time path planning for drone navigation.
Findings
Higher success rate in object navigation tasks
Lower navigation error compared to state-of-the-art methods
Reduced flight time in diverse environments
Abstract
Recent advances in large Vision-Language Models (VLMs) have provided rich semantic understanding that empowers drones to search for open-set objects via natural language instructions. However, prior systems struggle to integrate VLMs into practical aerial systems due to orders-of-magnitude frequency mismatch between VLM inference and real-time planning, as well as VLMs' limited 3D scene understanding. They also lack a unified mechanism to balance semantic guidance with motion efficiency in large-scale environments. To address these challenges, we present AirHunt, an aerial object navigation system that efficiently locates open-set objects with zero-shot generalization in outdoor environments by seamlessly fusing VLM semantic reasoning with continuous path planning. AirHunt features a dual-pathway asynchronous architecture that establishes a synergistic interface between VLM reasoning…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMultimodal Machine Learning Applications · Robotic Path Planning Algorithms · Robotics and Sensor-Based Localization
