Satellites swarm cooperation for pursuit-attachment tasks with transformer-based reinforcement learning
yonghao Li

TL;DR
This paper introduces a transformer-based reinforcement learning framework for satellite swarm cooperation in pursuit-attachment tasks, enhancing scalability, robustness, and efficiency under limited information conditions.
Contribution
It develops a novel multi-agent reinforcement learning approach integrating transformers and expert networks for improved satellite swarm coordination.
Findings
The proposed method outperforms existing algorithms in convergence speed.
The approach demonstrates high success rates in pursuit-attachment tasks.
Simulations confirm robustness across different maneuvering strategies.
Abstract
The on-orbit intelligent planning of satellites swarm has attracted increasing attention from scholars. Especially in tasks such as the pursuit and attachment of non-cooperative satellites, satellites swarm must achieve coordinated cooperation with limited resources. The study proposes a reinforcement learning framework that integrates the transformer and expert networks. Firstly, under the constraints of incomplete information about non-cooperative satellites, an implicit multi-satellites cooperation strategy was designed using a communication sharing mechanism. Subsequently, for the characteristics of the pursuit-attachment tasks, the multi-agent reinforcement learning framework is improved by introducing transformers and expert networks inspired by transfer learning ideas. To address the issue of satellites swarm scalability, sequence modelling based on transformers is utilized to…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsUAV Applications and Optimization · Spacecraft Dynamics and Control · Opportunistic and Delay-Tolerant Networks
