Multi-Agent Reinforcement Learning Counteracts Delayed CSI in Multi-Satellite Systems

Marios Aristodemou; Yasaman Omid; Sangarapillai Lambotharan; Mahsa Derakhshan; Lajos Hanzo

arXiv:2603.16470·cs.IT·March 18, 2026

Multi-Agent Reinforcement Learning Counteracts Delayed CSI in Multi-Satellite Systems

Marios Aristodemou, Yasaman Omid, Sangarapillai Lambotharan, Mahsa Derakhshan, Lajos Hanzo

PDF

Open Access

TL;DR

This paper introduces a multi-agent reinforcement learning algorithm called DS-PPO to improve downlink transmission in multi-satellite systems, effectively handling outdated channel state information and enhancing sum-rate performance.

Contribution

The paper proposes a novel bi-level optimisation algorithm, DS-PPO, for multi-satellite systems to cope with delayed CSI and optimize sum-rate in a distributed multi-antenna setup.

Findings

01

DS-PPO demonstrates robustness to CSI imperfections.

02

Significant sum-rate improvements with DS-PPO.

03

Convergence and computational complexity are analyzed.

Abstract

The integration of satellite communication networks with next-generation (NG) technologies is a promising approach towards global connectivity. However, the quality of services is highly dependant on the availability of accurate channel state information (CSI). Channel estimation in satellite communications is challenging due to the high propagation delay between terrestrial users and satellites, which results in outdated CSI observations on the satellite side. In this paper, we study the downlink transmission of multiple satellites acting as distributed base stations (BS) to mobile terrestrial users. We propose a multi-agent reinforcement learning (MARL) algorithm which aims for maximising the sum-rate of the users, while coping with the outdated CSI. We design a novel bi-level optimisation, procedure themes as dual stage proximal policy optimisation (DS-PPO), for tackling the problem…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSatellite Communication Systems · Advanced MIMO Systems Optimization · Age of Information Optimization