Commodities Trading through Deep Policy Gradient Methods
Jonas Hanetho

TL;DR
This paper explores deep reinforcement learning for commodities trading, introducing novel algorithms and time-discretization schemes that adapt to market volatility, resulting in significantly improved trading performance and customizable risk profiles.
Contribution
It presents new DRL algorithms with adaptive time discretization and demonstrates their effectiveness in commodities trading, outperforming traditional methods.
Findings
DRL models increase Sharpe ratio by 83% over buy-and-hold.
Actor-based models outperform actor-critic models.
CNN-based models slightly outperform LSTM-based models.
Abstract
Algorithmic trading has gained attention due to its potential for generating superior returns. This paper investigates the effectiveness of deep reinforcement learning (DRL) methods in algorithmic commodities trading. It formulates the commodities trading problem as a continuous, discrete-time stochastic dynamical system. The proposed system employs a novel time-discretization scheme that adapts to market volatility, enhancing the statistical properties of subsampled financial time series. To optimize transaction-cost- and risk-sensitive trading agents, two policy gradient algorithms, namely actor-based and actor-critic-based approaches, are introduced. These agents utilize CNNs and LSTMs as parametric function approximators to map historical price observations to market positions.Backtesting on front-month natural gas futures demonstrates that DRL models increase the Sharpe ratio by…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMarket Dynamics and Volatility · Stock Market Forecasting Methods · Energy Load and Power Forecasting
