Mitigating Data Scarcity in Spaceflight Applications for Offline Reinforcement Learning Using Physics-Informed Deep Generative Models
Alex E. Ballentine, Nachiket U. Bapat, and Raghvendra V. Cowlagi

TL;DR
This paper introduces a physics-informed generative model called MI-VAE to generate synthetic data, improving offline reinforcement learning in spaceflight applications with scarce real-world data.
Contribution
The paper presents MI-VAE, a novel physics-informed VAE that leverages physics-based models to generate realistic synthetic data for spaceflight RL tasks.
Findings
MI-VAE outperforms standard VAEs in statistical fidelity and diversity.
Augmenting datasets with MI-VAE improves RL policy success rates.
MI-VAE effectively incorporates physics constraints into data generation.
Abstract
The deployment of reinforcement learning (RL)-based controllers on physical systems is often limited by poor generalization to real-world scenarios, known as the simulation-to-reality (sim-to-real) gap. This gap is particularly challenging in spaceflight, where real-world training data are scarce due to high cost and limited planetary exploration data. Traditional approaches, such as system identification and synthetic data generation, depend on sufficient data and often fail due to modeling assumptions or lack of physics-based constraints. We propose addressing this data scarcity by introducing physics-based learning bias in a generative model. Specifically, we develop the Mutual Information-based Split Variational Autoencoder (MI-VAE), a physics-informed VAE that learns differences between observed system trajectories and those predicted by physics-based models. The latent space of…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
