Nonparametric Adaptive Bayesian Stochastic Control Under Model Uncertainty
Tao Chen, Jiyoun Myung

TL;DR
This paper introduces a nonparametric Bayesian approach using Dirichlet processes for stochastic control under model uncertainty, enabling online learning and avoiding model misspecification, with applications demonstrated in finance.
Contribution
It develops a novel nonparametric Bayesian methodology with numerical algorithms and Gaussian process surrogates for stochastic control under uncertainty, improving computational efficiency and robustness.
Findings
Outperforms parametric methods in financial applications
Enables online learning of unknown distributions
Reduces computational costs with surrogate models
Abstract
In this paper we propose a new methodology for solving a discrete time stochastic Markovian control problem under model uncertainty. By utilizing the Dirichlet process, we model the unknown distribution of the underlying stochastic process as a random probability measure and achieve online learning in a Bayesian manner. Our approach integrates optimizing and dynamic learning. When dealing with model uncertainty, the nonparametric framework allows us to avoid model misspecification that usually occurs in other classical control methods. Then, we develop a numerical algorithm to handle the infinitely dimensional state space in this setup and utilizes Gaussian process surrogates to obtain a functional representation of the value function in the Bellman recursion. We also build separate surrogates for optimal control to eliminate repeated optimizations on out-of-sample paths and bring…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGaussian Processes and Bayesian Inference · Bayesian Methods and Mixture Models · Statistical Methods and Inference
