On the implied weights of linear regression for causal inference
Ambarish Chattopadhyay, Jose R. Zubizarreta

TL;DR
This paper analyzes the implied weights of linear regression in causal inference, deriving new formulas, exploring their properties, and proposing diagnostics to improve observational study design.
Contribution
It introduces closed-form expressions for implied weights, studies their properties, and develops diagnostics for causal inference in observational studies.
Findings
Implied weights can be obtained via convex optimization.
Regression weights exhibit doubly and multiply robust properties.
Diagnostics improve the design stage of observational studies.
Abstract
A basic principle in the design of observational studies is to approximate the randomized experiment that would have been conducted under controlled circumstances. Now, linear regression models are commonly used to analyze observational data and estimate causal effects. How do linear regression adjustments in observational studies emulate key features of randomized experiments, such as covariate balance, self-weighted sampling, and study representativeness? In this paper, we provide answers to this and related questions by analyzing the implied (individual-level data) weights of linear regression methods. We derive new closed-form expressions of the weights and examine their properties in both finite and asymptotic regimes. We show that the implied weights of general regression problems can be equivalently obtained by solving a convex optimization problem. Among others, we study doubly…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Causal Inference Techniques · Statistical Methods and Inference · Statistical Methods and Bayesian Inference
