SimLOB: Learning Representations of Limited Order Book for Financial Market Simulation
Yuanzhe Li, Yue Wu, Muyao Zhong, Shengcai Liu, Peng Yang

TL;DR
This paper introduces a Transformer-based autoencoder to learn vectorized representations of Limit Order Book data, enabling high-fidelity financial market simulations by capturing detailed market micro-structure information.
Contribution
It presents the first method to utilize LOB data for market simulation calibration through learned vectorized representations, improving fidelity over previous mid-price based approaches.
Findings
Latent LOB representations preserve non-linear auto-correlation.
Representations capture precedence between price levels.
Performance is consistent across calibration tasks.
Abstract
Financial market simulation (FMS) serves as a promising tool for understanding market anomalies and the underlying trading behaviors. To ensure high-fidelity simulations, it is crucial to calibrate the FMS model for generating data closely resembling the observed market data. Previous efforts primarily focused on calibrating the mid-price data, leading to essential information loss of the market activities and thus biasing the calibrated model. The Limit Order Book (LOB) data is the fundamental data fully capturing the market micro-structure and is adopted by worldwide exchanges. However, LOB is not applicable to existing calibration objective functions due to its tabular structure not suitable for the vectorized input requirement. This paper proposes to explicitly learn the vectorized representations of LOB with a Transformer-based autoencoder. Then the latent vector, which captures…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStock Market Forecasting Methods · Mathematics, Computing, and Information Processing · Complex Systems and Time Series Analysis
