Dynamic Plasma Shape Control with Arbitrary Sensor Subsets
D. Sorokin, M. Stokolesov, A. Granovskiy, I. Prokofyev, E. Adishchev, M. Nurgaliev, E. Khayrutdinov, G. Subbotin, R. Clark, D. Orlov

TL;DR
This paper introduces a reinforcement learning-based control system for tokamak plasma shape regulation that is robust to sensor failures and capable of zero-shot transfer from simulation to real devices.
Contribution
It presents a novel RL agent trained in high-fidelity simulation that can handle arbitrary sensor subsets and transfer to physical tokamak experiments without retraining.
Findings
Achieves a mean shape error of 2.01 cm in simulation.
Demonstrates zero-shot transfer to DIII-D tokamak for dynamic shape control.
Robust to 30% magnetic sensor dropout during operation.
Abstract
Plasma shape control in tokamaks requires a real-time controller that tracks dynamically changing shape targets while tolerating diagnostic failures. Classical approaches decompose the problem into equilibrium reconstruction followed by a linear controller, and assume a fixed, fully operational sensor set. We present a reinforcement learning agent that addresses both limitations simultaneously. The agent is trained in NSFsim, a high-fidelity tokamak simulator configured for DIII-D, on a curated dataset of 120 experimental plasma shapes. The shape targets are resampled as random step changes every 0.25 s, exposing the agent to diverse transitions across the full shape envelope. At test time the agent zero-shot tracks dynamic shape sequences; on a held-out static configuration in simulation it achieves a mean shape error of 2.01 cm, and dynamic trajectory following is demonstrated…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
