Confidence intervals for the Cox model test error from cross-validation

Min Woo Sun; Robert Tibshirani

arXiv:2201.10770·stat.ME·October 10, 2023

Confidence intervals for the Cox model test error from cross-validation

Min Woo Sun, Robert Tibshirani

PDF

Open Access 1 Repo

TL;DR

This paper investigates the coverage properties of confidence intervals for test error in the Cox proportional hazards model, highlighting issues with standard CV intervals and proposing generalized nested CV methods.

Contribution

It extends the nested CV approach to the Cox model, providing new methods for more accurate confidence intervals for test error in survival analysis.

Findings

01

Standard CV confidence intervals often have below-nominal coverage.

02

Nested CV improves the accuracy of confidence intervals.

03

The paper explores different test error metrics for the Cox model.

Abstract

Cross-validation (CV) is one of the most widely used techniques in statistical learning for estimating the test error of a model, but its behavior is not yet fully understood. It has been shown that standard confidence intervals for test error using estimates from CV may have coverage below nominal levels. This phenomenon occurs because each sample is used in both the training and testing procedures during CV and as a result, the CV estimates of the errors become correlated. Without accounting for this correlation, the estimate of the variance is smaller than it should be. One way to mitigate this issue is by estimating the mean squared error of the prediction error instead using nested CV. This approach has been shown to achieve superior coverage compared to intervals derived from standard CV. In this work, we generalize the nested CV idea to the Cox proportional hazards model and…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

minwoosun/nestedcv-confint
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsStatistical Methods and Inference · Fault Detection and Control Systems · Advanced Statistical Methods and Models