Statistical matching and subclassification with a continuous dose: characterization, algorithm, and application to a health outcomes study
Bo Zhang, Emily J. Mackay, Mike Baiocchi

TL;DR
This paper develops optimal subclassification methods for continuous exposure doses in observational studies, improving causal inference accuracy and applying it to assess TEE monitoring effects on post-surgery mortality.
Contribution
It introduces two criteria and an efficient algorithm for subclassification with continuous doses, enhancing covariate adjustment in dose-response studies.
Findings
Proposed subclassification scheme reduces model dependence.
Algorithm guarantees optimal or near-optimal subclassification.
Application shows TEE monitoring lowers 30-day mortality.
Abstract
Subclassification and matching are often used in empirical studies to adjust for observed covariates; however, they are largely restricted to relatively simple study designs with a binary treatment and less developed for designs with a continuous exposure. Matching with exposure doses is particularly useful in instrumental variable designs and in understanding the dose-response relationships. In this article, we propose two criteria for optimal subclassification based on subclass homogeneity in the context of having a continuous exposure dose, and propose an efficient polynomial-time algorithm that is guaranteed to find an optimal subclassification with respect to one criterion and serves as a 2-approximation algorithm for the other criterion. We discuss how to incorporate dose and use appropriate penalties to control the number of subclasses in the design. Via extensive simulations, we…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStatistical Methods and Inference · Advanced Causal Inference Techniques · Statistical Methods and Bayesian Inference
