Entropy Regularization under Bayesian Drift Uncertainty

Andy Au

arXiv:2602.16862·math.OC·April 13, 2026

Entropy Regularization under Bayesian Drift Uncertainty

Andy Au

PDF

TL;DR

This paper analyzes entropy-regularized portfolio optimization under Bayesian drift uncertainty, showing Gaussian policies are optimal, with entropy affecting policy variance and providing belief-dependent robustness.

Contribution

It derives closed-form solutions for optimal policies under Bayesian uncertainty, highlighting how entropy regularization influences policy variance and robustness.

Findings

01

Gaussian policies remain optimal under partial information.

02

Entropy regularization affects only policy variance, not mean control.

03

Optimal policy variance increases with posterior conviction, enhancing robustness.

Abstract

We study entropy-regularized mean-variance portfolio optimization under Bayesian drift uncertainty. Gaussian policies remain optimal under partial information, the value function is quadratic in wealth, and belief-dependent coefficients admit closed-form solutions. The mean control is identical to deterministic Bayesian Markowitz feedback; entropy regularization affects only the policy variance. Additionally, this variance does not affect information gain, and instead provides belief-dependent robustness. Notably, optimal policy variance increases with posterior conviction $∣ m_{t} ∣$ , forcing greater action randomization when mean position is most aggressive.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.