The LOB Recreation Model: Predicting the Limit Order Book from TAQ History Using an Ordinary Differential Equation Recurrent Neural Network
Zijian Shi, Yu Chen, John Cartlidge

TL;DR
This paper introduces a deep learning model that accurately recreates the top five levels of the limit order book for small-tick stocks using only TAQ data, combining a history compiler, a market events simulator, and an adaptive weighting scheme.
Contribution
The paper presents the first deep learning approach to reconstruct the LOB from TAQ data, utilizing an ODE-RNN and transfer learning for cross-asset application.
Findings
High accuracy in LOB recreation demonstrated on real datasets
Model effectively predicts deep LOB volumes using TAQ data
Transfer learning enables efficient adaptation to different stocks
Abstract
In an order-driven financial market, the price of a financial asset is discovered through the interaction of orders - requests to buy or sell at a particular price - that are posted to the public limit order book (LOB). Therefore, LOB data is extremely valuable for modelling market dynamics. However, LOB data is not freely accessible, which poses a challenge to market participants and researchers wishing to exploit this information. Fortunately, trades and quotes (TAQ) data - orders arriving at the top of the LOB, and trades executing in the market - are more readily available. In this paper, we present the LOB recreation model, a first attempt from a deep learning perspective to recreate the top five price levels of the LOB for small-tick stocks using only TAQ data. Volumes of orders sitting deep in the LOB are predicted by combining outputs from: (1) a history compiler that uses a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
TopicsStock Market Forecasting Methods · Financial Markets and Investment Strategies · Complex Systems and Time Series Analysis
