A novel Deep Neural Network architecture for non-linear system   identification

Luca Zancato; Alessandro Chiuso

arXiv:2106.03078·cs.LG·June 8, 2021

A novel Deep Neural Network architecture for non-linear system identification

Luca Zancato, Alessandro Chiuso

PDF

TL;DR

This paper introduces a new deep neural network architecture designed for non-linear system identification, emphasizing generalization, automatic complexity selection, and suitability for large datasets.

Contribution

The paper proposes a novel DNN architecture with inductive bias and regularization, reducing hyper-parameter tuning and improving scalability for non-linear system identification.

Findings

01

Effective on large-scale datasets

02

Reduces hyper-parameter tuning

03

Enhances generalization in system identification

Abstract

We present a novel Deep Neural Network (DNN) architecture for non-linear system identification. We foster generalization by constraining DNN representational power. To do so, inspired by fading memory systems, we introduce inductive bias (on the architecture) and regularization (on the loss function). This architecture allows for automatic complexity selection based solely on available data, in this way the number of hyper-parameters that must be chosen by the user is reduced. Exploiting the highly parallelizable DNN framework (based on Stochastic optimization methods) we successfully apply our method to large scale datasets.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.