Plasma Shape Control via Zero-shot Generative Reinforcement Learning
Niannian Wu, Rongpeng Li, Zongyu Yang, Yong Xiao, Ning Wei, Yihang Chen, Bo Li, Zhifeng Zhao, and Wulyu Zhong

TL;DR
This paper introduces a zero-shot reinforcement learning framework that leverages offline data and generative imitation learning to achieve adaptable plasma shape control without task-specific retraining.
Contribution
It combines GAIL with Hilbert space representation learning to create a versatile, goal-oriented control policy from offline data, enabling zero-shot deployment for plasma shape control.
Findings
Policy accurately tracks plasma shape trajectories in simulations.
Method outperforms traditional controllers in stability and precision.
Framework demonstrates potential for future fusion reactor control systems.
Abstract
Traditional PID controllers have limited adaptability for plasma shape control, and task-specific reinforcement learning (RL) methods suffer from limited generalization and the need for repetitive retraining. To overcome these challenges, this paper proposes a novel framework for developing a versatile, zero-shot control policy from a large-scale offline dataset of historical PID-controlled discharges. Our approach synergistically combines Generative Adversarial Imitation Learning (GAIL) with Hilbert space representation learning to achieve dual objectives: mimicking the stable operational style of the PID data and constructing a geometrically structured latent space for efficient, goal-directed control. The resulting foundation policy can be deployed for diverse trajectory tracking tasks in a zero-shot manner without any task-specific fine-tuning. Evaluations on the HL-3 tokamak…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMagnetic confinement fusion research · Adaptive Dynamic Programming Control · Plasma and Flow Control in Aerodynamics
