Parameterized Reinforcement Learning for Optical System Optimization

Heribert Wankerl; Maike L. Stern; Ali Mahdavi; Christoph; Eichler; Elmar W. Lang

arXiv:2010.05769·cs.LG·November 17, 2021

Parameterized Reinforcement Learning for Optical System Optimization

Heribert Wankerl, Maike L. Stern, Ali Mahdavi, Christoph, Eichler, Elmar W. Lang

PDF

TL;DR

This paper introduces a parameterized reinforcement learning approach for multi-parameter optical system design, effectively optimizing discrete and continuous parameters, outperforming human experts and naive algorithms, and enabling interpretability of optical properties.

Contribution

It presents a novel RL-based method that incorporates discrete and continuous design parameters for optical systems, improving optimization and interpretability.

Findings

01

Outperforms human experts in optical design optimization.

02

Outperforms naive RL algorithms in achieving optical characteristics.

03

Provides interpretable Q-values related to optical properties.

Abstract

Designing a multi-layer optical system with designated optical characteristics is an inverse design problem in which the resulting design is determined by several discrete and continuous parameters. In particular, we consider three design parameters to describe a multi-layer stack: Each layer's dielectric material and thickness as well as the total number of layers. Such a combination of both, discrete and continuous parameters is a challenging optimization problem that often requires a computationally expensive search for an optimal system design. Hence, most methods merely determine the optimal thicknesses of the system's layers. To incorporate layer material and the total number of layers as well, we propose a method that considers the stacking of consecutive layers as parameterized actions in a Markov decision process. We propose an exponentially transformed reward signal that eases…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

MethodsQ-Learning