TL;DR
This paper introduces three RL trading environments with realistic market impact models, demonstrating significant effects on trading behavior and algorithm performance, and provides an open-source toolkit for more accurate backtesting.
Contribution
It develops three new trading environments incorporating nonlinear market impact models, enabling more realistic RL trading research and evaluation.
Findings
Market impact models significantly alter algorithm performance and trading costs.
The Almgren-Chriss impact model reduces costs and turnover dramatically.
Hyperparameter tuning is crucial for realistic trading behavior and cost reduction.
Abstract
Reinforcement learning (RL) has shown promise for trading, yet most open-source backtesting environments assume negligible or fixed transaction costs, causing agents to learn trading behaviors that fail under realistic execution. We introduce three Gymnasium-compatible trading environments -- MACE (Market-Adjusted Cost Execution) stock trading, margin trading, and portfolio optimization -- that integrate nonlinear market impact models grounded in the Almgren-Chriss framework and the empirically validated square-root impact law. Each environment provides pluggable cost models, permanent impact tracking with exponential decay, and comprehensive trade-level logging. We evaluate five DRL algorithms (A2C, PPO, DDPG, SAC, TD3) on the NASDAQ-100, comparing a fixed 10 bps baseline against the AC model with Optuna-tuned hyperparameters. Our results show that (i) the cost model materially changes…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
