Model-Free Reinforcement Learning for Asset Allocation
Adebayo Oshingbesan, Eniola Ajiboye, Peruth Kamashazi, Timothy Mbaka

TL;DR
This paper demonstrates that model-free deep reinforcement learning agents, especially on-policy actor-critic types, can significantly outperform traditional baselines in asset allocation tasks using real-world stock data.
Contribution
It provides empirical evidence that RL agents, particularly on-policy actor-critic models, can effectively improve portfolio management over classical methods.
Findings
RL agents outperform baseline strategies.
Actor-critic on-policy agents perform best.
No significant difference between value-based and policy-based RL.
Abstract
Asset allocation (or portfolio management) is the task of determining how to optimally allocate funds of a finite budget into a range of financial instruments/assets such as stocks. This study investigated the performance of reinforcement learning (RL) when applied to portfolio management using model-free deep RL agents. We trained several RL agents on real-world stock prices to learn how to perform asset allocation. We compared the performance of these RL agents against some baseline agents. We also compared the RL agents among themselves to understand which classes of agents performed better. From our analysis, RL agents can perform the task of portfolio management since they significantly outperformed two of the baseline agents (random allocation and uniform allocation). Four RL agents (A2C, SAC, PPO, and TRPO) outperformed the best baseline, MPT, overall. This shows the abilities of…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsFinancial Markets and Investment Strategies · Stock Market Forecasting Methods · Economic theories and models
Methods1x1 Convolution · Entropy Regularization · Convolution · Dilated Convolution · Proximal Policy Optimization · Global Average Pooling · Average Pooling · Switchable Atrous Convolution
