General Bayesian Policy Learning

Masahiro Kato

arXiv:2602.23672·stat.ML·March 2, 2026

General Bayesian Policy Learning

Masahiro Kato

PDF

Open Access

TL;DR

This paper introduces the General Bayes framework for policy learning, using loss-based Bayesian updating and squared-loss surrogates to optimize decision rules with theoretical guarantees.

Contribution

It develops a novel Bayesian approach for policy learning that incorporates a squared-loss surrogate and provides theoretical PAC-Bayes guarantees.

Findings

01

Maximizes empirical welfare via a scaled squared error approach

02

Provides a Gaussian pseudo-likelihood interpretation of the generalized posterior

03

Demonstrates neural network implementation with theoretical guarantees

Abstract

This study proposes the General Bayes framework for policy learning. We consider decision problems in which a decision-maker chooses an action from an action set to maximize its expected welfare. Typical examples include treatment choice and portfolio selection. In such problems, the statistical target is a decision rule, and the prediction of each outcome $Y (a)$ is not necessarily of primary interest. We formulate this policy learning problem by loss-based Bayesian updating. Our main technical device is a squared-loss surrogate for welfare maximization. We show that maximizing empirical welfare over a policy class is equivalent to minimizing a scaled squared error in the outcome difference, up to a quadratic regularization controlled by a tuning parameter $ζ > 0$ . This rewriting yields a General Bayes posterior over decision rules that admits a Gaussian pseudo-likelihood…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Causal Inference Techniques · Advanced Bandit Algorithms Research · Explainable Artificial Intelligence (XAI)