Hybrid LSTM and PPO Networks for Dynamic Portfolio Optimization

Jun Kevin; Pujianto Yugopuspito

arXiv:2511.17963·cs.LG·November 25, 2025

Hybrid LSTM and PPO Networks for Dynamic Portfolio Optimization

Jun Kevin, Pujianto Yugopuspito

PDF

Open Access

TL;DR

This paper presents a hybrid deep learning framework combining LSTM forecasting and PPO reinforcement learning to optimize portfolios dynamically, demonstrating improved performance across diverse financial assets and market conditions.

Contribution

The paper introduces a novel hybrid LSTM-PPO model that integrates time-series forecasting with adaptive reinforcement learning for portfolio management, outperforming traditional methods.

Findings

01

Higher annualized returns compared to baselines

02

Greater resilience in volatile market regimes

03

Improved risk-adjusted performance metrics

Abstract

This paper introduces a hybrid framework for portfolio optimization that fuses Long Short-Term Memory (LSTM) forecasting with a Proximal Policy Optimization (PPO) reinforcement learning strategy. The proposed system leverages the predictive power of deep recurrent networks to capture temporal dependencies, while the PPO agent adaptively refines portfolio allocations in continuous action spaces, allowing the system to anticipate trends while adjusting dynamically to market shifts. Using multi-asset datasets covering U.S. and Indonesian equities, U.S. Treasuries, and major cryptocurrencies from January 2018 to December 2024, the model is evaluated against several baselines, including equal-weight, index-style, and single-model variants (LSTM-only and PPO-only). The framework's performance is benchmarked against equal-weighted, index-based, and single-model approaches (LSTM-only and…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsStock Market Forecasting Methods · Advanced Bandit Algorithms Research · Risk and Portfolio Optimization