A safe Hosmer-Lemeshow test
Alexander Henzi, Marius Puke, Timo Dimitriadis, Johanna Ziegel

TL;DR
This paper introduces a new calibration test for binary event predictions using e-values and online isotonic regression, offering improved stability and power over the traditional Hosmer-Lemeshow test, demonstrated through simulations and real data.
Contribution
It proposes an e-value based calibration test that is theoretically superior and more stable than the Hosmer-Lemeshow test, addressing its known limitations.
Findings
The eHL test has power against all alternatives.
It detects slight miscalibrations in practical sample sizes.
It provides clearer results on credit default predictions during a crisis.
Abstract
This article proposes an alternative to the Hosmer-Lemeshow (HL) test for evaluating the calibration of probability forecasts for binary events. The approach is based on e-values, a new tool for hypothesis testing. An e-value is a random variable with expected value less or equal to one under a null hypothesis. Large e-values give evidence against the null hypothesis, and the multiplicative inverse of an e-value is a p-value. Our test uses online isotonic regression to estimate the calibration curve as a `betting strategy' against the null hypothesis. We show that the test has power against essentially all alternatives, which makes it theoretically superior to the HL test and at the same time resolves the well-known instability problem of the latter. A simulation study shows that a feasible version of the proposed eHL test can detect slight miscalibrations in practically relevant sample…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMonetary Policy and Economic Impact · Financial Risk and Volatility Modeling · Financial Markets and Investment Strategies
