How informative is the Order Book Beyond the Best Levels? Machine Learning Perspective
Dat Thanh Tran, Juho Kanniainen, Alexandros Iosifidis

TL;DR
This study evaluates the predictive value of deeper order book levels beyond the best quotes using machine learning, finding that while top levels are most informative, incorporating all levels enhances prediction accuracy.
Contribution
It demonstrates that all order book levels contribute to price movement prediction, with deeper levels providing additional information beyond the best quotes.
Findings
Top levels are most informative for prediction.
Using all levels improves model performance.
Informativeness decreases from first to fourth level.
Abstract
Research on limit order book markets has been rapidly growing and nowadays high-frequency full order book data is widely available for researchers and practitioners. However, it is common that research papers use the best level data only, which motivates us to ask whether the exclusion of the quotes deeper in the book over multiple price levels causes performance degradation. In this paper, we address this question by using modern Machine Learning (ML) techniques to predict mid-price movements without assuming that limit order book markets represent a linear system. We provide a number of results that are robust across ML prediction models, feature selection algorithms, data sets, and prediction horizons. We find that the best bid and ask levels are systematically identified not only as the most informative levels in the order books, but also to carry most of the information needed for…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsFinancial Markets and Investment Strategies · Auction Theory and Applications · Corporate Finance and Governance
