Approximate Conditional Coverage & Calibration via Neural Model   Approximations

Allen Schmaltz; Danielle Rasooly

arXiv:2205.14310·cs.LG·December 2, 2022

Approximate Conditional Coverage & Calibration via Neural Model Approximations

Allen Schmaltz, Danielle Rasooly

PDF

Open Access

TL;DR

This paper introduces a novel calibration method for deep classifiers using KNN-based approximations and Mondrian Conformal Predictors, achieving well-calibrated, robust prediction sets in NLP tasks with distribution shifts.

Contribution

It develops a data-driven partitioning approach with a new Inductive Venn Predictor for improved calibration and robustness in deep model uncertainty quantification.

Findings

01

Achieves well-calibrated prediction sets in NLP classification tasks.

02

Demonstrates robustness to data partition changes and covariate shifts.

03

Outperforms recent conformal prediction methods on challenging datasets.

Abstract

A typical desideratum for quantifying the uncertainty from a classification model as a prediction set is class-conditional singleton set calibration. That is, such sets should map to the output of well-calibrated selective classifiers, matching the observed frequencies of similar instances. Recent works proposing adaptive and localized conformal p-values for deep networks do not guarantee this behavior, nor do they achieve it empirically. Instead, we use the strong signals for prediction reliability from KNN-based approximations of Transformer networks to construct data-driven partitions for Mondrian Conformal Predictors, which are treated as weak selective classifiers that are then calibrated via a new Inductive Venn Predictor, the Venn-ADMIT Predictor. The resulting selective classifiers are well-calibrated, in a conservative but practically useful sense for a given threshold. They…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMachine Learning and Data Classification · Natural Language Processing Techniques · Imbalanced Data Classification Techniques

MethodsMulti-Head Attention · Attention Is All You Need · Linear Layer · Label Smoothing · Adam · Dense Connections · Absolute Position Encodings · Position-Wise Feed-Forward Layer · Dropout · Layer Normalization