Optimal Multilevel Matching in Clustered Observational Studies: A Case Study of the Effectiveness of Private Schools Under a Large-Scale Voucher System
Luke Keele, Jose R. Zubizarreta

TL;DR
This paper develops an optimal multilevel matching method for clustered observational studies, using dynamic programming to improve covariate balance at both cluster and individual levels, demonstrated through a case study on Chilean voucher schools.
Contribution
It introduces a novel matching strategy that prioritizes individual matching before cluster matching, extending optimal matching to hierarchical data without relying on propensity scores.
Findings
The proposed method effectively balances covariates at multiple levels.
It can approximate a paired clustered randomized design.
Applied to Chilean voucher schools, it provided insights into treatment effects.
Abstract
A distinctive feature of a clustered observational study is its multilevel or nested data structure arising from the assignment of treatment, in a non-random manner, to groups or clusters of units or individuals. Examples are ubiquitous in the health and social sciences including patients in hospitals, employees in firms, and students in schools. What is the optimal matching strategy in a clustered observational study? At first thought, one might start by matching clusters of individuals and then, within matched clusters, continue by matching individuals. But as we discuss in this paper, the optimal strategy is the opposite: in typical applications, where the intracluster correlation is not perfect, it is best to first match individuals and, once all possible combinations of matched individuals are known, then match clusters. In this paper we use dynamic and integer programming to…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Causal Inference Techniques · Healthcare Policy and Management · Urban, Neighborhood, and Segregation Studies
