Error Autocorrelation Objective Function for Improved System Modeling
Anand Ramakrishnan, Warren B.Jackson, Kent Evans

TL;DR
This paper introduces a new whitening cost function based on the Ljung-Box statistic for deep learning models, which reduces error correlations to improve generalization and extrapolation in system modeling tasks.
Contribution
The paper proposes a novel whitening objective function that minimizes error autocorrelations, enhancing model generalization beyond traditional MSE-based training.
Findings
Significant improvement in generalization for RNNs and image autoencoders.
Better extrapolation capabilities demonstrated in simulated and real mechanical systems.
Enhanced spatial and temporal error independence leading to more reliable control systems.
Abstract
Deep learning models are trained to minimize the error between the model's output and the actual values. The typical cost function, the Mean Squared Error (MSE), arises from maximizing the log-likelihood of additive independent, identically distributed Gaussian noise. However, minimizing MSE fails to minimize the residuals' cross-correlations, leading to over-fitting and poor extrapolation of the model outside the training set (generalization). In this paper, we introduce a "whitening" cost function, the Ljung-Box statistic, which not only minimizes the error but also minimizes the correlations between errors, ensuring that the fits enforce compatibility with an independent and identically distributed (i.i.d) gaussian noise model. The results show significant improvement in generalization for recurrent neural networks (RNNs) (1d) and image autoencoders (2d). Specifically, we look at…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGenerative Adversarial Networks and Image Synthesis · Gaussian Processes and Bayesian Inference · Advanced Neural Network Applications
