TLE-Based A2C Agent for Terrestrial Coverage Orbital Path Planning
Anantha Narayanan, Battu Bhanu Teja, Pruthwik Mishra

TL;DR
This paper introduces a reinforcement learning approach using A2C for optimizing satellite orbits to improve terrestrial coverage, demonstrating faster convergence and better performance than PPO in a physics-based simulation environment.
Contribution
The work develops a TLE-based orbital simulation environment and validates the effectiveness of A2C over PPO for continuous orbital control tasks.
Findings
A2C outperforms PPO in reward and convergence speed.
The method achieves rapid adaptation for satellite deployment.
The environment accurately models orbital physics constraints.
Abstract
The increasing congestion of Low Earth Orbit (LEO) poses persistent challenges to the efficient deployment and safe operation of Earth observation satellites. Mission planners must now account not only for mission-specific requirements but also for the increasing collision risk with active satellites and space debris. This work presents a reinforcement learning framework using the Advantage Actor-Critic (A2C) algorithm to optimize satellite orbital parameters for precise terrestrial coverage within predefined surface radii. By formulating the problem as a Markov Decision Process (MDP) within a custom OpenAI Gymnasium environment, our method simulates orbital dynamics using classical Keplerian elements. The agent progressively learns to adjust five of the orbital parameters - semi-major axis, eccentricity, inclination, right ascension of ascending node, and the argument of perigee-to…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
