Risk-Aware Deep Reinforcement Learning for Dynamic Portfolio Optimization

Emmanuel Lwele; Sabuni Emmanuel; and Sitali Gabriel Sitali

arXiv:2511.11481·q-fin.PM·November 17, 2025

Risk-Aware Deep Reinforcement Learning for Dynamic Portfolio Optimization

Emmanuel Lwele, Sabuni Emmanuel, and Sitali Gabriel Sitali

PDF

Open Access

TL;DR

This paper develops a deep reinforcement learning framework for portfolio optimization that incorporates risk measures like drawdown and volatility, aiming to improve risk management in asset allocation.

Contribution

It introduces a novel DRL approach with risk-aware reward functions and direct risk constraints, advancing the application of RL in financial portfolio management.

Findings

01

DRL stabilizes portfolio volatility effectively.

02

Risk-adjusted returns decrease due to conservative policies.

03

Highlights the importance of reward shaping for better performance.

Abstract

This paper presents a deep reinforcement learning (DRL) framework for dynamic portfolio optimization under market uncertainty and risk. The proposed model integrates a Sharpe ratio-based reward function with direct risk control mechanisms, including maximum drawdown and volatility constraints. Proximal Policy Optimization (PPO) is employed to learn adaptive asset allocation strategies over historical financial time series. Model performance is benchmarked against mean-variance and equal-weight portfolio strategies using backtesting on high-performing equities. Results indicate that the DRL agent stabilizes volatility successfully but suffers from degraded risk-adjusted returns due to over-conservative policy convergence, highlighting the challenge of balancing exploration, return maximization, and risk mitigation. The study underscores the need for improved reward shaping and hybrid…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsRisk and Portfolio Optimization · Stock Market Forecasting Methods · Advanced Bandit Algorithms Research