Reinforcement Learning in Non-Markov Market-Making

Luca Lalor; Anatoliy Swishchuk

arXiv:2410.14504·q-fin.CP·March 3, 2025

Reinforcement Learning in Non-Markov Market-Making

Luca Lalor, Anatoliy Swishchuk

PDF

Open Access

TL;DR

This paper introduces a deep reinforcement learning framework using the Soft Actor-Critic algorithm for optimal market-making in complex price environments with semi-Markov and Hawkes jump-diffusion dynamics, demonstrating its effectiveness through detailed simulations.

Contribution

It develops a novel RL approach tailored for non-Markovian market dynamics, integrating advanced jump-diffusion models and addressing adverse selection in market-making strategies.

Findings

01

RL framework effectively models complex price dynamics

02

Deep RL outperforms traditional methods in simulated environments

03

Insights into the evolution of trading variables and reward functions

Abstract

We develop a deep reinforcement learning (RL) framework for an optimal market-making (MM) trading problem, specifically focusing on price processes with semi-Markov and Hawkes Jump-Diffusion dynamics. We begin by discussing the basics of RL and the deep RL framework used, where we deployed the state-of-the-art Soft Actor-Critic (SAC) algorithm for the deep learning part. The SAC algorithm is an off-policy entropy maximization algorithm more suitable for tackling complex, high-dimensional problems with continuous state and action spaces like in optimal market-making (MM). We introduce the optimal MM problem considered, where we detail all the deterministic and stochastic processes that go into setting up an environment for simulating this strategy. Here we also give an in-depth overview of the jump-diffusion pricing dynamics used, our method for dealing with adverse selection within the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAuction Theory and Applications · Supply Chain and Inventory Management

MethodsAverage Pooling · Dilated Convolution · Convolution · Global Average Pooling · 1x1 Convolution · Diffusion · Switchable Atrous Convolution