Model Based Reinforcement Learning with Non-Gaussian Environment Dynamics and its Application to Portfolio Optimization
Huifang Huang, Ting Gao, Pengbo Li, Jin Guo, Peng Zhang, and Nan Du

TL;DR
This paper introduces a model-based reinforcement learning approach using heavy-tailed normalizing flows to simulate complex financial environments, improving portfolio optimization and robustness during crises.
Contribution
It develops a novel RL framework with heavy-tailed environment modeling and applies it to multi-market stock trading, demonstrating improved performance and crisis mitigation.
Findings
Dow market achieved the best performance among tested markets.
The method reduced maximum drawdown during COVID-19 crisis.
Visualization and analysis revealed effective portfolio strategies and convergence properties.
Abstract
With the fast development of quantitative portfolio optimization in financial engineering, lots of AI-based algorithmic trading strategies have demonstrated promising results, among which reinforcement learning begins to manifest competitive advantages. However, the environment from real financial markets is complex and hard to be fully simulated, considering the observation of abrupt transitions, unpredictable hidden causal factors, heavy tail properties and so on. Thus, in this paper, first, we adopt a heavy-tailed preserving normalizing flows to simulate high-dimensional joint probability of the complex trading environment and develop a model-based reinforcement learning framework to better understand the intrinsic mechanisms of quantitative online trading. Second, we experiment with various stocks from three different financial markets (Dow, NASDAQ and S&P) and show that among these…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStock Market Forecasting Methods · Financial Markets and Investment Strategies · Advanced Bandit Algorithms Research
MethodsNormalizing Flows
