Better Measurement or Larger Samples? Data Collection for Policy Learning with Unobserved Heterogeneity

Giacomo Opocher

arXiv:2604.07181·econ.EM·May 6, 2026

Better Measurement or Larger Samples? Data Collection for Policy Learning with Unobserved Heterogeneity

Giacomo Opocher

PDF

TL;DR

This paper analyzes how data collection strategies, including sample size and measurement precision, impact policy effectiveness when unobserved heterogeneity affects treatment responses.

Contribution

It introduces a framework for designing optimal data collection plans balancing measurement precision and sample size to improve policy outcomes.

Findings

01

Including a proxy for entrepreneurs' skills increases welfare by 5%.

02

Halves the probability of welfare losses with better proxies.

03

Provides a method to allocate resources between measurement precision and sample size.

Abstract

Empirical research shows that individuals' responses to treatments vary along latent characteristics, such as innate ability or motivation. Therefore, a policymaker seeking to maximize welfare may consider designing policies based on observed characteristics and estimated latent traits. I characterize how the estimates' precision affects the worst-case performance of policies deriving rate-sharp regret bounds for assignment rules that include or exclude them, highlighting new trade-offs with the policy space complexity. I then study how a policymaker can solve such trade-offs by designing tailored data collections and derive a sufficient condition for a collection plan to be minimax optimal. In an empirical application in development economics, I show that including a proxy for entrepreneurs' business skills in targeting cash transfers increases welfare by 5%, and halves the probability…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.