Neural Network Training Using Closed-Loop Data: Hazards and an   Instrumental Variable (IVNN) Solution

Johan Kon; Marcel Heertjes; Tom Oomen

arXiv:2202.05337·eess.SY·March 31, 2023

Neural Network Training Using Closed-Loop Data: Hazards and an Instrumental Variable (IVNN) Solution

Johan Kon, Marcel Heertjes, Tom Oomen

PDF

Open Access

TL;DR

This paper identifies the problem of parameter inconsistency in neural network control systems trained with closed-loop data and proposes an instrumental variable method (IVNN) to achieve consistent estimates and optimal control performance.

Contribution

It introduces the IVNN approach that uses instrumental variables to ensure consistent neural network parameter estimation in closed-loop control systems.

Findings

01

IVNN asymptotically recovers the optimal solution

02

Pre-existing methods lead to inconsistent estimates

03

Demonstrated effectiveness on a nonlinear system example

Abstract

An increasing trend in the use of neural networks in control systems is being observed. The aim of this paper is to reveal that the straightforward application of learning neural network feedforward controllers with closed-loop data may introduce parameter inconsistency that degrades control performance, and to provide a solution. The proposed method employs instrumental variables to ensure consistent parameter estimates. A nonlinear system example reveals that the developed instrumental variable neural network (IVNN) approach asymptotically recovers the optimal solution, while pre-existing approaches are shown to lead to inconsistent estimates.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsFault Detection and Control Systems · Control Systems and Identification · Neural Networks and Applications