Generative AI for End-to-End Limit Order Book Modelling: A Token-Level Autoregressive Generative Model of Message Flow Using a Deep State Space Network
Peer Nagy, Sascha Frey, Silvia Sapora, Kang Li, Anisoara Calinescu,, Stefan Zohren, Jakob Foerster

TL;DR
This paper introduces a novel autoregressive generative model for tokenized limit order book messages, demonstrating promising data distribution approximation and potential applications in high-frequency financial modeling.
Contribution
It presents the first end-to-end autoregressive model for generating tokenized LOB messages using structured state-space layers and a custom tokenizer, advancing financial data simulation.
Findings
Low model perplexity indicates good data distribution fit.
Generated mid-price returns correlate well with real data.
Model enables future high-frequency financial applications.
Abstract
Developing a generative model of realistic order flow in financial markets is a challenging open problem, with numerous applications for market participants. Addressing this, we propose the first end-to-end autoregressive generative model that generates tokenized limit order book (LOB) messages. These messages are interpreted by a Jax-LOB simulator, which updates the LOB state. To handle long sequences efficiently, the model employs simplified structured state-space layers to process sequences of order book states and tokenized messages. Using LOBSTER data of NASDAQ equity LOBs, we develop a custom tokenizer for message data, converting groups of successive digits to tokens, similar to tokenization in large language models. Out-of-sample results show promising performance in approximating the data distribution, as evidenced by low model perplexity. Furthermore, the mid-price returns…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsFinancial Markets and Investment Strategies · Stock Market Forecasting Methods · Complex Systems and Time Series Analysis
