Deep Policy Gradient Methods in Commodity Markets
Jonas Hanetho

TL;DR
This paper explores the application of deep policy gradient reinforcement learning methods to commodities trading, demonstrating significant improvements in risk-adjusted returns and adaptability to market volatility.
Contribution
It introduces novel deep reinforcement learning algorithms with adaptive time discretization for commodities trading, outperforming traditional models in backtests.
Findings
Deep RL models achieve 83% higher Sharpe ratio than buy-and-hold.
Actor-based algorithms outperform actor-critic ones.
CNN architectures slightly outperform LSTM models.
Abstract
The energy transition has increased the reliance on intermittent energy sources, destabilizing energy markets and causing unprecedented volatility, culminating in the global energy crisis of 2021. In addition to harming producers and consumers, volatile energy markets may jeopardize vital decarbonization efforts. Traders play an important role in stabilizing markets by providing liquidity and reducing volatility. Several mathematical and statistical models have been proposed for forecasting future returns. However, developing such models is non-trivial due to financial markets' low signal-to-noise ratios and nonstationary dynamics. This thesis investigates the effectiveness of deep reinforcement learning methods in commodities trading. It formalizes the commodities trading problem as a continuing discrete-time stochastic dynamical system. This system employs a novel…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMarket Dynamics and Volatility · Energy Load and Power Forecasting · Stock Market Forecasting Methods
MethodsTanh Activation · Sigmoid Activation · Long Short-Term Memory
