TL;DR
This paper extends and validates the Limit Order Book Recreation Model (LOBRM) using larger, chronologically ordered datasets, improving its accuracy and efficiency for real-world market microstructure applications.
Contribution
The study enhances LOBRM with new standardization and kernel methods, and validates its effectiveness on extended datasets in realistic scenarios.
Findings
LOBRM with decay kernel outperforms traditional models
Prediction accuracy decreases with higher order volume volatility
Sparse encoding of TAQ data shows strong generalization
Abstract
The limit order book (LOB) depicts the fine-grained demand and supply relationship for financial assets and is widely used in market microstructure studies. Nevertheless, the availability and high cost of LOB data restrict its wider application. The LOB recreation model (LOBRM) was recently proposed to bridge this gap by synthesizing the LOB from trades and quotes (TAQ) data. However, in the original LOBRM study, there were two limitations: (1) experiments were conducted on a relatively small dataset containing only one day of LOB data; and (2) the training and testing were performed in a non-chronological fashion, which essentially re-frames the task as interpolation and potentially introduces lookahead bias. In this study, we extend the research on LOBRM and further validate its use in real-world application scenarios. We first advance the workflow of LOBRM by (1) adding a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
MethodsExponential Decay
