A Modular Framework for Reinforcement Learning Optimal Execution
Fernando de Meer Pardo, Christoph Auth, Florin Dascalu

TL;DR
This paper presents a flexible, modular framework for applying Reinforcement Learning to optimal trade execution, emphasizing environment simulation, component interactions, and realistic evaluation procedures.
Contribution
It introduces a modular environment design for RL-based trade execution, enabling flexible simulation setups and realistic evaluation methods.
Findings
Demonstrated a setup using limit orders and TWAP benchmarks
Highlighted the divergence between simulation and real market behavior
Developed evaluation procedures with iterative re-training
Abstract
In this article, we develop a modular framework for the application of Reinforcement Learning to the problem of Optimal Trade Execution. The framework is designed with flexibility in mind, in order to ease the implementation of different simulation setups. Rather than focusing on agents and optimization methods, we focus on the environment and break down the necessary requirements to simulate an Optimal Trade Execution under a Reinforcement Learning framework such as data pre-processing, construction of observations, action processing, child order execution, simulation of benchmarks, reward calculations etc. We give examples of each component, explore the difficulties their individual implementations \& the interactions between them entail, and discuss the different phenomena that each component induces in the simulation, highlighting the divergences between the simulation and the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAuction Theory and Applications · Reinforcement Learning in Robotics · Supply Chain and Inventory Management
