On the number of bins in a rank histogram

Claudio Heinrich

arXiv:2005.09018·stat.AP·September 30, 2022

On the number of bins in a rank histogram

Claudio Heinrich

PDF

TL;DR

This paper investigates how the choice of bin number affects the interpretation of rank histograms in meteorology, proposing a method to select an optimal bin count that balances visual and statistical assessments.

Contribution

It introduces a new method for choosing the number of bins in rank histograms, improving the detection of calibration issues in forecast systems.

Findings

01

Choosing fewer bins than ensemble size plus one can improve calibration detection.

02

The proposed method aligns visual interpretation with formal statistical testing.

03

Optimal bin number depends on the amount of verification data available.

Abstract

Rank histograms are popular tools for assessing the reliability of meteorological ensemble forecast systems. A reliable forecast system leads to a uniform rank histogram, and deviations from uniformity can indicate miscalibrations. However, the ability to identify such deviations by visual inspection of rank histogram plots crucially depends on the number of bins chosen for the histogram. If too few bins are chosen, the rank histogram is likely to miss miscalibrations; if too many are chosen, even perfectly calibrated forecast systems can yield rank histograms that do not appear uniform. In this paper we address this trade-off and propose a method for choosing the number of bins for a rank histogram. The goal of our method is to select a number of bins such that the intuitive decision whether a histogram is uniform or not is as close as possible to a formal statistical test. Our results…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.