Reliable Statistical Guarantees for Conformal Predictors with Small Datasets
Miguel S\'anchez-Dom\'inguez, Lucas Lacasa, Javier de Vicente, Gonzalo Rubio, Eusebio Valero

TL;DR
This paper introduces a new statistical guarantee for conformal predictors that remains reliable with small datasets, improving uncertainty quantification in surrogate models for safety-critical applications.
Contribution
It proposes a probabilistic coverage guarantee for conformal predictors that is effective even with limited calibration data, bridging the gap in existing methods.
Findings
The new guarantee converges to standard CP with large datasets.
It provides meaningful coverage information for small calibration sets.
Validated through multiple examples and implemented in open-source software.
Abstract
Surrogate models (including deep neural networks and other machine learning algorithms in supervised learning) are capable of approximating arbitrarily complex, high-dimensional input-output problems in science and engineering, but require a thorough data-agnostic uncertainty quantification analysis before these can be deployed for any safety-critical application. The standard approach for data-agnostic uncertainty quantification is to use conformal prediction (CP), a well-established framework to build uncertainty models with proven statistical guarantees that do not assume any shape for the error distribution of the surrogate model. However, since the classic statistical guarantee offered by CP is given in terms of bounds for the marginal coverage, for small calibration set sizes (which are frequent in realistic surrogate modelling that aims to quantify error at different regions),…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdversarial Robustness in Machine Learning · Probabilistic and Robust Engineering Design · Advanced Multi-Objective Optimization Algorithms
