Neural Operator based Reinforcement Learning for Control of first-order   PDEs with Spatially-Varying State Delay

Jiaqi Hu; Jie Qi; and Jing Zhang

arXiv:2501.18201·cs.AI·January 31, 2025

Neural Operator based Reinforcement Learning for Control of first-order PDEs with Spatially-Varying State Delay

Jiaqi Hu, Jie Qi, and Jing Zhang

PDF

Open Access 1 Repo

TL;DR

This paper introduces a novel reinforcement learning framework that combines DeepONet and backstepping control to effectively manage unstable PDEs with spatially-varying delays, outperforming traditional methods.

Contribution

It proposes a new DeepONet-based RL approach that eliminates the need for explicit delay function assumptions in controlling PDEs.

Findings

01

Outperforms baseline SAC without backstepping knowledge

02

Successfully controls unstable first-order hyperbolic PDEs with delays

03

Demonstrates effectiveness of combining analytical control with deep learning

Abstract

Control of distributed parameter systems affected by delays is a challenging task, particularly when the delays depend on spatial variables. The idea of integrating analytical control theory with learning-based control within a unified control scheme is becoming increasingly promising and advantageous. In this paper, we address the problem of controlling an unstable first-order hyperbolic PDE with spatially-varying delays by combining PDE backstepping control strategies and deep reinforcement learning (RL). To eliminate the assumption on the delay function required for the backstepping design, we propose a soft actor-critic (SAC) architecture incorporating a DeepONet to approximate the backstepping controller. The DeepONet extracts features from the backstepping controller and feeds them into the policy network. In simulations, our algorithm outperforms the baseline SAC without prior…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

jackyhum/NeuralOperatorFeedbackWithSpatiallyVaryingStateDelay
pytorch

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsExtremum Seeking Control Systems

MethodsDilated Convolution · Convolution · 1x1 Convolution · Average Pooling · Global Average Pooling · Switchable Atrous Convolution