EM Algorithm and Stochastic Control in Economics
Steven Kou, Xianhua Peng, Xingbo Xu

TL;DR
This paper introduces the EM-Control algorithm, extending the classical EM method to multi-period stochastic control problems, using forward-backward Monte Carlo simulation for policy optimization with proven convergence.
Contribution
It proposes a novel EM-Control algorithm that applies EM principles to stochastic control, enabling efficient policy updates in complex multi-period problems.
Findings
Effective in monopoly pricing of perishable assets
Successfully applied to real business cycle analysis
Exhibits monotonic performance improvement
Abstract
Generalising the idea of the classical EM algorithm that is widely used for computing maximum likelihood estimates, we propose an EM-Control (EM-C) algorithm for solving multi-period finite time horizon stochastic control problems. The new algorithm sequentially updates the control policies in each time period using Monte Carlo simulation in a forward-backward manner; in other words, the algorithm goes forward in simulation and backward in optimization in each iteration. Similar to the EM algorithm, the EM-C algorithm has the monotonicity of performance improvement in each iteration, leading to good convergence properties. We demonstrate the effectiveness of the algorithm by solving stochastic control problems in the monopoly pricing of perishable assets and in the study of real business cycle.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
