Proxy-Guided Measurement Calibration
Saketh Vishnubhatla, Shu Wan, Andre Harrison, Adrienne Raglin, Huan Liu

TL;DR
This paper introduces a proxy-guided framework using causal modeling and variational autoencoders to identify and correct systematic measurement errors in aggregate outcome data, improving accuracy in fields like disaster loss estimation.
Contribution
It proposes a novel two-stage method leveraging proxy variables and variational autoencoders to disentangle true outcomes from systematic biases in measurement data.
Findings
Effective bias correction demonstrated on synthetic data
Improved outcome estimation in semi-synthetic datasets
Case study shows practical utility in disaster loss reporting
Abstract
Aggregate outcome variables collected through surveys and administrative records are often subject to systematic measurement error. For instance, in disaster loss databases, county-level losses reported may differ from the true damages due to variations in on-the-ground data collection capacity, reporting practices, and event characteristics. Such miscalibration complicates downstream analysis and decision-making. We study the problem of outcome miscalibration and propose a framework guided by proxy variables for estimating and correcting the systematic errors. We model the data-generating process using a causal graph that separates latent content variables driving the true outcome from the latent bias variables that induce systematic errors. The key insight is that proxy variables that depend on the true outcome but are independent of the bias mechanism provide identifying information…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Causal Inference Techniques · Data-Driven Disease Surveillance · Survey Methodology and Nonresponse
