Multimodal Large Language Model Enabled Robust Beamforming for HAP Downlink Communications

Xiaoyu Xing; Peng Yang; Guoquan Tao; Dingyi Lu; Zehui Xiong; Xianbin Cao

arXiv:2604.09017·cs.NI·April 13, 2026

Multimodal Large Language Model Enabled Robust Beamforming for HAP Downlink Communications

Xiaoyu Xing, Peng Yang, Guoquan Tao, Dingyi Lu, Zehui Xiong, Xianbin Cao

PDF

TL;DR

This paper introduces a multimodal large language model framework that enhances the robustness of HAP downlink beamforming by forecasting attitude deviations and proactively adjusting beam directions, improving communication reliability and efficiency.

Contribution

The work develops a vision-language LLM for attitude prediction, a calibration procedure for forecast errors, and a QoS-driven beamforming method, advancing robust HAP communication strategies.

Findings

01

Achieves 22.1% higher user service ratio

02

Achieves 12.5% higher sum-rate

03

Supports practical delay-aware deployment with mean latency 36.24 ms

Abstract

Small changes in high altitude platform (HAP) attitude can cause significant deviations in HAP downlink beam directions, thereby severely degrading HAP downlink communication performance. In this paper, we develop a multimodal large language model (LLM) enabled beamforming framework to achieve robust HAP downlink communications.Specifically, we design a vision-language LLM (VL-LLM) that learns from multivariate flight telemetry to forecast short-term HAP attitudes under platform shaking and support delay-aware proactive beam steering.We design an offline forecast-error calibration procedure to obtain upper bounds on forecast errors and improve the reliability of proactive analog beam steering.Based on the attitude forecasts, we proactively update the analog beamformer and propose a QoS-driven beamforming and admission method with a lightweight feasibility-enforcement step to satisfy…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.