Backward Stochastic Control System with Entropy Regularization

Ziyue Chen; Qi Zhang

arXiv:2411.13219·math.OC·November 21, 2024·SIAM J. Control. Optim.

Backward Stochastic Control System with Entropy Regularization

Ziyue Chen, Qi Zhang

PDF

Open Access

TL;DR

This paper develops a theoretical framework for entropy-regularized optimal control in backward stochastic systems, providing conditions for optimality and establishing existence and uniqueness results, with potential applications in finance and algorithms.

Contribution

It introduces a novel approach to backward stochastic control with entropy regularization, including a stochastic maximum principle and existence-uniqueness results.

Findings

01

Established stochastic maximum principle for the control system.

02

Proved sufficient conditions and implicit form of optimal control.

03

Demonstrated existence and uniqueness in linear-quadratic cases.

Abstract

The entropy regularization is inspired by information entropy from machine learning and the ideas of exploration and exploitation in reinforcement learning, which appears in the control problem to design an approximating algorithm for the optimal control. This paper is concerned with the optimal exploratory control for backward stochastic system, generated by the backward stochastic differential equation and with the entropy regularization in its cost functional. We give the theoretical depict of the optimal relaxed control so as to lay the foundation for the application of such a backward stochastic control system to mathematical finance and algorithm implementation. For this, we first establish the stochastic maximum principle by convex variation method. Then we prove sufficient condition for the optimal control and demonstrate the implicit form of optimal control. Finally, the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsFault Detection and Control Systems · Advanced Control Systems Optimization · Stability and Control of Uncertain Systems