Learning Counterfactual Distributions via Kernel Nearest Neighbors
Kyuseong Choi, Jacob Feitelberg, Caleb Chin, Anish Agarwal, Raaz Dwivedi

TL;DR
This paper introduces a kernel-based nearest neighbors method for estimating multivariate distributions across units and outcomes, effectively handling missing not at random data and unobserved confounding in distributional matrix completion.
Contribution
It proposes a novel distributional matrix completion framework using kernel mean embeddings and maximum mean discrepancies, ensuring consistent recovery under challenging data conditions.
Findings
Consistent distribution recovery despite missing not at random data.
Robustness to heteroscedastic noise with multiple measurements.
Effective estimation in the presence of unobserved confounding.
Abstract
Consider a setting with multiple units (e.g., individuals, cohorts, geographic locations) and outcomes (e.g., treatments, times, items), where the goal is to learn a multivariate distribution for each unit-outcome entry, such as the distribution of a user's weekly spend and engagement under a specific mobile app version. A common challenge is the prevalence of missing not at random data, where observations are available only for certain unit-outcome combinations and the observation availability can be correlated with the properties of distributions themselves, i.e., there is unobserved confounding. An additional challenge is that for any observed unit-outcome entry, we only have a finite number of samples from the underlying distribution. We tackle these two challenges by casting the problem into a novel distributional matrix completion framework and introduce a kernel based…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAnomaly Detection Techniques and Applications · Face and Expression Recognition
MethodsDistributional Generalization
