Plotting correlated data

Lukas Koch

arXiv:2601.20805·stat.ME·May 5, 2026

Plotting correlated data

Lukas Koch

PDF

TL;DR

This paper discusses the challenges of interpreting correlated uncertainties in data visualization plots and proposes methods to improve the assessment of model fit by visualizing principal components and conditional uncertainties.

Contribution

It introduces techniques to enhance data plots by explicitly showing principal components and conditional uncertainties, aiding better interpretation of correlated errors.

Findings

01

Correlated uncertainties complicate model fit assessment in plots.

02

Adding principal component contributions improves understanding of data-model agreement.

03

Displaying conditional uncertainties helps identify where models may be deficient.

Abstract

A very common task in data visualization is to plot many data points with some measured y-value as a function of fixed x-values. Uncertainties on the y-values are typically presented as vertical error bars that represent either a Frequentist confidence interval or Bayesian credible interval for each data point. Most of the time, these error bars represent a 68\% confidence/credibility level, which leads to the intuition that a model fits the data reasonably well if its prediction lies within the error bars of roughly two thirds of the data points. Unfortunately, this and other intuitions no longer work when the uncertainties of the data points are correlated. If the error bars only show the square root of diagonal elements of some covariance matrix with non-negligible off-diagonal elements, we simply do not have enough information in the plot to judge whether a drawn model line agrees…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.