Model-free and Bayesian Ensembling Model-based Deep Reinforcement   Learning for Particle Accelerator Control Demonstrated on the FERMI FEL

Simon Hirlaender; Niky Bruchon

arXiv:2012.09737·cs.LG·January 27, 2022·5 cites

Model-free and Bayesian Ensembling Model-based Deep Reinforcement Learning for Particle Accelerator Control Demonstrated on the FERMI FEL

Simon Hirlaender, Niky Bruchon

PDF

Open Access 1 Repo

TL;DR

This paper compares model-free and model-based deep reinforcement learning methods for particle accelerator control, demonstrating that model-based approaches offer higher sample efficiency and better representational power, with practical implementation on the FERMI FEL system.

Contribution

It introduces a DYNA-style model-based reinforcement learning algorithm with uncertainty awareness for accelerator control, and compares it to a deep Q-learning approach, highlighting their respective advantages.

Findings

01

Model-based RL shows higher sample efficiency.

02

Model-free RL achieves slightly better asymptotic performance.

03

Both methods exhibit increased noise robustness in accelerator control.

Abstract

Reinforcement learning holds tremendous promise in accelerator controls. The primary goal of this paper is to show how this approach can be utilised on an operational level on accelerator physics problems. Despite the success of model-free reinforcement learning in several domains, sample-efficiency still is a bottle-neck, which might be encompassed by model-based methods. We compare well-suited purely model-based to model-free reinforcement learning applied to the intensity optimisation on the FERMI FEL system. We find that the model-based approach demonstrates higher representational power and sample-efficiency, while the asymptotic performance of the model-free method is slightly superior. The model-based algorithm is implemented in a DYNA-style using an uncertainty aware model, and the model-free algorithm is based on tailored deep Q-learning. In both cases, the algorithms were…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

MathPhysSim/FERMI_RL_Paper
tfOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReservoir Engineering and Simulation Methods · Scientific Computing and Data Management · Model Reduction and Neural Networks

MethodsAttentive Walk-Aggregating Graph Neural Network