Error Autocorrelation Objective Function for Improved System Modeling

Anand Ramakrishnan; Warren B.Jackson; Kent Evans

arXiv:2008.03582·cs.LG·May 12, 2021

Error Autocorrelation Objective Function for Improved System Modeling

Anand Ramakrishnan, Warren B.Jackson, Kent Evans

PDF

Open Access

TL;DR

This paper introduces a new whitening cost function based on the Ljung-Box statistic for deep learning models, which reduces error correlations to improve generalization and extrapolation in system modeling tasks.

Contribution

The paper proposes a novel whitening objective function that minimizes error autocorrelations, enhancing model generalization beyond traditional MSE-based training.

Findings

01

Significant improvement in generalization for RNNs and image autoencoders.

02

Better extrapolation capabilities demonstrated in simulated and real mechanical systems.

03

Enhanced spatial and temporal error independence leading to more reliable control systems.

Abstract

Deep learning models are trained to minimize the error between the model's output and the actual values. The typical cost function, the Mean Squared Error (MSE), arises from maximizing the log-likelihood of additive independent, identically distributed Gaussian noise. However, minimizing MSE fails to minimize the residuals' cross-correlations, leading to over-fitting and poor extrapolation of the model outside the training set (generalization). In this paper, we introduce a "whitening" cost function, the Ljung-Box statistic, which not only minimizes the error but also minimizes the correlations between errors, ensuring that the fits enforce compatibility with an independent and identically distributed (i.i.d) gaussian noise model. The results show significant improvement in generalization for recurrent neural networks (RNNs) (1d) and image autoencoders (2d). Specifically, we look at…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsGenerative Adversarial Networks and Image Synthesis · Gaussian Processes and Bayesian Inference · Advanced Neural Network Applications