Focal plane wavefront control with model-based reinforcement learning
Jalo Nousiainen, Iremsu Taskin, Markus Kasper, Gilles Orban De Xivry, Olivier Absil

TL;DR
This paper introduces a model-based reinforcement learning method, PO4NCPA, for automatic correction of static and dynamic non-common-path aberrations in high-contrast imaging, improving exoplanet observation capabilities.
Contribution
It extends reinforcement learning for focal plane control, enabling real-time, model-free correction of aberrations without prior system knowledge.
Findings
PO4NCPA robustly compensates static and dynamic NCPAs in simulations.
Achieves near-optimal PSF suppression and Strehl ratio.
Effective under photon noise, background noise, and for ELT pupils.
Abstract
The direct imaging of potentially habitable exoplanets is one prime science case for high-contrast imaging instruments on extremely large telescopes. Most such exoplanets orbit close to their host stars, where their observation is limited by fast-moving atmospheric speckles and quasi-static non-common-path aberrations (NCPA). Conventional NCPA correction methods often use mechanical mirror probes, which compromise performance during operation. This work presents machine-learning-based NCPA control methods that automatically detect and correct both dynamic and static NCPA errors by leveraging sequential phase diversity. We extend previous work in reinforcement learning for AO to focal plane control. A new model-based RL algorithm, Policy Optimization for NCPAs (PO4NCPA), interprets the focal-plane image as input data and, through sequential phase diversity, determines phase corrections…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
