Deep Reinforcement Learning for Asset Allocation: Reward Clipping

Jiwon Kim; Moon-Ju Kang; KangHun Lee; HyungJun Moon and; Bo-Kwan Jeon

arXiv:2301.05300·q-fin.CP·January 16, 2023

Deep Reinforcement Learning for Asset Allocation: Reward Clipping

Jiwon Kim, Moon-Ju Kang, KangHun Lee, HyungJun Moon and, Bo-Kwan Jeon

PDF

Open Access

TL;DR

This paper compares various reinforcement learning algorithms for asset allocation, introduces a reward clipping model, and demonstrates its superior performance in both bullish and bearish market conditions.

Contribution

It introduces a reward clipping reinforcement learning model tailored for finance, showing improved stability and performance over existing methods in portfolio optimization.

Findings

01

Reward clipping model outperforms other RL models in finance.

02

The model maintains stability in both bull and bear markets.

03

Compared to traditional strategies, RL models show competitive results.

Abstract

Recently, there are many trials to apply reinforcement learning in asset allocation for earning more stable profits. In this paper, we compare performance between several reinforcement learning algorithms - actor-only, actor-critic and PPO models. Furthermore, we analyze each models' character and then introduce the advanced algorithm, so called Reward clipping model. It seems that the Reward Clipping model is better than other existing models in finance domain, especially portfolio optimization - it has strength both in bull and bear markets. Finally, we compare the performance for these models with traditional investment strategies during decreasing and increasing markets.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsFinancial Markets and Investment Strategies · Economic theories and models · Complex Systems and Time Series Analysis

MethodsEntropy Regularization · Proximal Policy Optimization