Model Selection for Treatment Choice: Penalized Welfare Maximization
Eric Mbakop, Max Tabord-Meehan

TL;DR
This paper introduces a penalized decision rule called PWM for treatment assignment, enabling optimal model selection over various covariate subsets and providing theoretical guarantees for regret bounds in policy choice.
Contribution
It extends statistical learning methods to treatment choice, developing the PWM rule with oracle inequalities for model selection and regret bounds.
Findings
PWM performs effective model selection among covariate subsets.
Theoretical regret bounds are established for PWM.
Hold-out procedure formalizes data-driven model selection.
Abstract
This paper studies a penalized statistical decision rule for the treatment assignment problem. Consider the setting of a utilitarian policy maker who must use sample data to allocate a binary treatment to members of a population, based on their observable characteristics. We model this problem as a statistical decision problem where the policy maker must choose a subset of the covariate space to assign to treatment, out of a class of potential subsets. We focus on settings in which the policy maker may want to select amongst a collection of constrained subset classes: examples include choosing the number of covariates over which to perform best-subset selection, and model selection when approximating a complicated class via a sieve. We adapt and extend results from statistical learning to develop the Penalized Welfare Maximization (PWM) rule. We establish an oracle inequality for the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
