A Leakage Bound for Confidence Sets after Black-Box Selection

Sayantan Banerjee

arXiv:2604.26706·math.ST·April 30, 2026

A Leakage Bound for Confidence Sets after Black-Box Selection

Sayantan Banerjee

PDF

TL;DR

This paper establishes a bound on the noncoverage probability of confidence sets after black-box selection, linking it to the information the selection process reveals about the data.

Contribution

It introduces a novel leakage bound for confidence sets post black-box selection, quantifying the inferential cost in terms of information measures.

Findings

01

Bound on noncoverage involving total variation distance

02

Mutual-information bound for black-box selection

03

Guarantees for noisy screening with Gaussian bounds

Abstract

In many analyses the object reported at the end is not fixed in advance, but is chosen after a preliminary search over variables, subgroups, transformations, models or contrasts. Classical selective-inference methods are most effective when this search can be written as an explicit selection event. This note treats the less structured case in which the selection rule is a black box and inference is required for the target indexed by the selected object. We show that, for any fixed-target confidence procedure, selected-target noncoverage is bounded by the nominal fixed-target noncoverage plus the average total variation distance between the marginal law of the inferential data and its conditional law given the selected object. A mutual-information bound follows immediately. The result recovers sample splitting as the zero-leakage case and gives explicit guarantees for noisy screening…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.