Statistical Testing under Distributional Shifts
Nikolaj Thams, Sorawit Saengkyongam, Niklas Pfister, Jonas Peters

TL;DR
This paper develops a statistical testing framework that accounts for distributional shifts between observed data and target distributions, ensuring valid inference even when data is sampled under different conditions.
Contribution
It introduces a resampling-based testing procedure that maintains asymptotic properties under distributional shifts and extends to finite sample and uniform levels, with applications in various fields.
Findings
The proposed test inherits asymptotic level and power under known shifts.
It maintains guarantees when the shift map is estimated from data.
Applications include reinforcement learning, covariate shift, and causal inference.
Abstract
In this work, we introduce statistical testing under distributional shifts. We are interested in the hypothesis for a target distribution , but observe data from a different distribution . We assume that is related to through a known shift and formally introduce hypothesis testing in this setting. We propose a general testing procedure that first resamples from the observed data to construct an auxiliary data set and then applies an existing test in the target domain. We prove that if the size of the resample is at most and the resampling weights are well-behaved, this procedure inherits the pointwise asymptotic level and power from the target test. If the map is estimated from data, we can maintain the above guarantees under mild conditions if the estimation works sufficiently well. We further extend our results to finite…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning and Algorithms · Statistical Methods and Inference · Markov Chains and Monte Carlo Methods
