Test-time augmentation improves efficiency in conformal prediction

Divya Shanmugam; Helen Lu; Swami Sankaranarayanan; John Guttag

arXiv:2505.22764·cs.LG·May 30, 2025

Test-time augmentation improves efficiency in conformal prediction

Divya Shanmugam, Helen Lu, Swami Sankaranarayanan, John Guttag

PDF

Open Access

TL;DR

This paper demonstrates that test-time augmentation significantly reduces the size of prediction sets in conformal classifiers, improving efficiency without retraining across various datasets and models.

Contribution

It introduces a flexible, efficient TTA method that reduces conformal prediction set sizes by 10-14% without retraining, applicable to any conformal score.

Findings

01

Test-time augmentation reduces conformal set sizes by 10-14%.

02

The approach is effective across multiple datasets, models, and conformal scoring methods.

03

TTA improves efficiency under different distribution shifts.

Abstract

A conformal classifier produces a set of predicted classes and provides a probabilistic guarantee that the set includes the true class. Unfortunately, it is often the case that conformal classifiers produce uninformatively large sets. In this work, we show that test-time augmentation (TTA)--a technique that introduces inductive biases during inference--reduces the size of the sets produced by conformal classifiers. Our approach is flexible, computationally efficient, and effective. It can be combined with any conformal score, requires no model retraining, and reduces prediction set sizes by 10%-14% on average. We conduct an evaluation of the approach spanning three datasets, three models, two established conformal scoring methods, different guarantee strengths, and several distribution shifts to show when and why test-time augmentation is a useful addition to the conformal pipeline.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMachine Learning and Data Classification · Imbalanced Data Classification Techniques · Face and Expression Recognition

MethodsSparse Evolutionary Training