Uncertainty Quantification of Data-Driven Output Predictors in the Output Error Setting
Farzan Kaviani, Ivan Markovsky, and Hamid R. Ossareh

TL;DR
This paper develops theoretical bounds for the prediction error of data-driven output predictors in noisy settings, showing how noise affects accuracy and comparing direct data use versus low-rank approximation.
Contribution
It introduces two upper bounds on prediction error in noisy data scenarios, applicable without knowing the true system output, and compares the effectiveness of de-noising heuristics.
Findings
Prediction bounds decrease linearly with noise level.
Using raw data can be as effective as de-noising heuristics.
Bounds do not require ground truth output, only noisy measurements and system order.
Abstract
We revisit the problem of predicting the output of an LTI system directly using offline input-output data (and without the use of a parametric model) in the behavioral setting. Existing works calculate the output predictions by projecting the recent samples of the input and output signals onto the column span of a Hankel matrix consisting of the offline input-output data. However, if the offline data is corrupted by noise, the output prediction is no longer exact. While some prior works propose mitigating noisy data through matrix low-ranking approximation heuristics, such as truncated singular value decomposition, the ensuing prediction accuracy remains unquantified. This paper fills these gaps by introducing two upper bounds on the prediction error under the condition that the noise is sufficiently small relative to the offline data's magnitude. The first bound pertains to prediction…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsFault Detection and Control Systems
