LLM-Land: Large Language Models for Context-Aware Drone Landing
Siwei Cai, Yuwei Wu, Lifeng Zhou

TL;DR
This paper introduces LLM-Land, a hybrid system combining large language models with model predictive control to enable context-aware, safe, and precise autonomous drone landings in complex environments.
Contribution
It presents a novel framework integrating vision-language encoding, LLM-based scene understanding, and MPC for improved drone landing safety and accuracy in unstructured settings.
Findings
Reduces near-miss incidents with dynamic obstacles.
Maintains high landing accuracy in cluttered environments.
Outperforms traditional vision-based MPC approaches.
Abstract
Autonomous landing is essential for drones deployed in emergency deliveries, post-disaster response, and other large-scale missions. By enabling self-docking on charging platforms, it facilitates continuous operation and significantly extends mission endurance. However, traditional approaches often fall short in dynamic, unstructured environments due to limited semantic awareness and reliance on fixed, context-insensitive safety margins. To address these limitations, we propose a hybrid framework that integrates large language model (LLMs) with model predictive control (MPC). Our approach begins with a vision-language encoder (VLE) (e.g., BLIP), which transforms real-time images into concise textual scene descriptions. These descriptions are processed by a lightweight LLM (e.g., Qwen 2.5 1.5B or LLaMA 3.2 1B) equipped with retrieval-augmented generation (RAG) to classify scene elements…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMultimodal Machine Learning Applications · UAV Applications and Optimization · Robotics and Sensor-Based Localization
