
TL;DR
This paper introduces the General Bayes framework for policy learning, using loss-based Bayesian updating and squared-loss surrogates to optimize decision rules with theoretical guarantees.
Contribution
It develops a novel Bayesian approach for policy learning that incorporates a squared-loss surrogate and provides theoretical PAC-Bayes guarantees.
Findings
Maximizes empirical welfare via a scaled squared error approach
Provides a Gaussian pseudo-likelihood interpretation of the generalized posterior
Demonstrates neural network implementation with theoretical guarantees
Abstract
This study proposes the General Bayes framework for policy learning. We consider decision problems in which a decision-maker chooses an action from an action set to maximize its expected welfare. Typical examples include treatment choice and portfolio selection. In such problems, the statistical target is a decision rule, and the prediction of each outcome is not necessarily of primary interest. We formulate this policy learning problem by loss-based Bayesian updating. Our main technical device is a squared-loss surrogate for welfare maximization. We show that maximizing empirical welfare over a policy class is equivalent to minimizing a scaled squared error in the outcome difference, up to a quadratic regularization controlled by a tuning parameter . This rewriting yields a General Bayes posterior over decision rules that admits a Gaussian pseudo-likelihood…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Causal Inference Techniques · Advanced Bandit Algorithms Research · Explainable Artificial Intelligence (XAI)
