Safe Reinforcement Learning Beyond Baseline Control: A Hierarchical Framework for Space Triangle Tethered Formation System
Xinyi Tao, Panfeng Huang, Fan Zhang

TL;DR
This paper introduces a hierarchical reinforcement learning framework combining model-based control and SAC to improve precision, efficiency, and safety in space tethered formations, validated through simulations.
Contribution
A novel hierarchical RL control framework for TTFS that integrates baseline control with SAC, enhancing deployment accuracy and energy efficiency.
Findings
Reduces steady-state tether tracking errors by over 96%.
Decreases fuel consumption by two orders of magnitude.
Proves closed-loop stability using Lyapunov methods.
Abstract
Triangular tethered formation system (TTFS) provide a promising platform for deep space exploration and distributed sensing due to its intrinsic spatial-orientation stability and capability of adjusting distances among node satellites through deployment and retrieval of tethers. However, due to the coupled tether-satellite dynamics and disturbance sensitivity of TTFS, traditional control methods struggle to achieve a balanced trade-off among configuration accuracy requirements, tension constraints, and energy efficiency consumption throughout the deployment process.In this paper, a novel model-reference reinforcement learning control framework is proposed for TTFS. By integrating baseline model-based control with a Soft Actor-Critic (SAC) compensator, the proposed method simultaneously achieves high-precision tracking, fuel efficiency, and compliance with tension limits. A hierarchical…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSpace Satellite Systems and Control · Spacecraft Dynamics and Control · Inertial Sensor and Navigation
