Exploratory Utility Maximization Problem with Tsallis Entropy
Chen Ziyi, Gu Jia-wen

TL;DR
This paper explores utility maximization with Tsallis entropy regularization in a complete market, revealing conditions for well-posedness, characterizing optimal strategies, and demonstrating reinforcement learning advantages through numerical experiments.
Contribution
It introduces Tsallis entropy into utility maximization, analyzes well-posedness issues, and provides semi-closed-form solutions for specific cases, extending classical models.
Findings
Optimal strategies include Gaussian and Wigner semicircle distributions.
Well-posedness depends on the primary temperature function.
Reinforcement learning effectively approximates optimal policies.
Abstract
We study expected utility maximization problem with constant relative risk aversion utility function in a complete market under the reinforcement learning framework. To induce exploration, we introduce the Tsallis entropy regularizer, which generalizes the commonly used Shannon entropy. Unlike the classical Merton's problem, which is always well-posed and admits closed-form solutions, we find that the utility maximization exploratory problem is ill-posed in certain cases, due to over-exploration. With a carefully selected primary temperature function, we investigate two specific examples, for which we fully characterize their well-posedness and provide semi-closed-form solutions. It is interesting to find that one example has the well-known Gaussian distribution as the optimal strategy, while the other features the rare Wigner semicircle distribution, which is equivalent to a scaled…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStatistical Mechanics and Entropy · Forecasting Techniques and Applications · Statistical Distribution Estimation and Applications
