Deep Policy Gradient Methods in Commodity Markets

Jonas Hanetho

arXiv:2308.01910·q-fin.TR·August 7, 2023·1 cites

Deep Policy Gradient Methods in Commodity Markets

Jonas Hanetho

PDF

Open Access

TL;DR

This paper explores the application of deep policy gradient reinforcement learning methods to commodities trading, demonstrating significant improvements in risk-adjusted returns and adaptability to market volatility.

Contribution

It introduces novel deep reinforcement learning algorithms with adaptive time discretization for commodities trading, outperforming traditional models in backtests.

Findings

01

Deep RL models achieve 83% higher Sharpe ratio than buy-and-hold.

02

Actor-based algorithms outperform actor-critic ones.

03

CNN architectures slightly outperform LSTM models.

Abstract

The energy transition has increased the reliance on intermittent energy sources, destabilizing energy markets and causing unprecedented volatility, culminating in the global energy crisis of 2021. In addition to harming producers and consumers, volatile energy markets may jeopardize vital decarbonization efforts. Traders play an important role in stabilizing markets by providing liquidity and reducing volatility. Several mathematical and statistical models have been proposed for forecasting future returns. However, developing such models is non-trivial due to financial markets' low signal-to-noise ratios and nonstationary dynamics. This thesis investigates the effectiveness of deep reinforcement learning methods in commodities trading. It formalizes the commodities trading problem as a continuing discrete-time stochastic dynamical system. This system employs a novel…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMarket Dynamics and Volatility · Energy Load and Power Forecasting · Stock Market Forecasting Methods

MethodsTanh Activation · Sigmoid Activation · Long Short-Term Memory