Multimodal Large Language Model Enabled Robust Beamforming for HAP Downlink Communications
Xiaoyu Xing, Peng Yang, Guoquan Tao, Dingyi Lu, Zehui Xiong, Xianbin Cao

TL;DR
This paper introduces a multimodal large language model framework that enhances the robustness of HAP downlink beamforming by forecasting attitude deviations and proactively adjusting beam directions, improving communication reliability and efficiency.
Contribution
The work develops a vision-language LLM for attitude prediction, a calibration procedure for forecast errors, and a QoS-driven beamforming method, advancing robust HAP communication strategies.
Findings
Achieves 22.1% higher user service ratio
Achieves 12.5% higher sum-rate
Supports practical delay-aware deployment with mean latency 36.24 ms
Abstract
Small changes in high altitude platform (HAP) attitude can cause significant deviations in HAP downlink beam directions, thereby severely degrading HAP downlink communication performance. In this paper, we develop a multimodal large language model (LLM) enabled beamforming framework to achieve robust HAP downlink communications.Specifically, we design a vision-language LLM (VL-LLM) that learns from multivariate flight telemetry to forecast short-term HAP attitudes under platform shaking and support delay-aware proactive beam steering.We design an offline forecast-error calibration procedure to obtain upper bounds on forecast errors and improve the reliability of proactive analog beam steering.Based on the attitude forecasts, we proactively update the analog beamformer and propose a QoS-driven beamforming and admission method with a lightweight feasibility-enforcement step to satisfy…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
