Hybrid LSTM and PPO Networks for Dynamic Portfolio Optimization
Jun Kevin, Pujianto Yugopuspito

TL;DR
This paper presents a hybrid deep learning framework combining LSTM forecasting and PPO reinforcement learning to optimize portfolios dynamically, demonstrating improved performance across diverse financial assets and market conditions.
Contribution
The paper introduces a novel hybrid LSTM-PPO model that integrates time-series forecasting with adaptive reinforcement learning for portfolio management, outperforming traditional methods.
Findings
Higher annualized returns compared to baselines
Greater resilience in volatile market regimes
Improved risk-adjusted performance metrics
Abstract
This paper introduces a hybrid framework for portfolio optimization that fuses Long Short-Term Memory (LSTM) forecasting with a Proximal Policy Optimization (PPO) reinforcement learning strategy. The proposed system leverages the predictive power of deep recurrent networks to capture temporal dependencies, while the PPO agent adaptively refines portfolio allocations in continuous action spaces, allowing the system to anticipate trends while adjusting dynamically to market shifts. Using multi-asset datasets covering U.S. and Indonesian equities, U.S. Treasuries, and major cryptocurrencies from January 2018 to December 2024, the model is evaluated against several baselines, including equal-weight, index-style, and single-model variants (LSTM-only and PPO-only). The framework's performance is benchmarked against equal-weighted, index-based, and single-model approaches (LSTM-only and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStock Market Forecasting Methods · Advanced Bandit Algorithms Research · Risk and Portfolio Optimization
