Mid-price Prediction Based on Machine Learning Methods with Technical and Quantitative Indicators
Adamantios Ntakaris, Juho Kanniainen, Moncef Gabbouj, Alexandros, Iosifidis

TL;DR
This paper explores machine learning techniques for short-term mid-price stock prediction using over 270 features derived from technical and quantitative analysis, emphasizing feature selection and online learning methods.
Contribution
It introduces a new adaptive logistic regression-based quantitative feature and evaluates feature selection methods for improved prediction accuracy.
Findings
Few selected features achieve optimal prediction performance.
The adaptive logistic regression feature outperforms other features.
Effective feature combination enhances mid-price movement prediction.
Abstract
Stock price prediction is a challenging task, but machine learning methods have recently been used successfully for this purpose. In this paper, we extract over 270 hand-crafted features (factors) inspired by technical and quantitative analysis and tested their validity on short-term mid-price movement prediction. We focus on a wrapper feature selection method using entropy, least-mean squares, and linear discriminant analysis. We also build a new quantitative feature based on adaptive logistic regression for online learning, which is constantly selected first among the majority of the proposed feature selection methods. This study examines the best combination of features using high frequency limit order book data from Nasdaq Nordic. Our results suggest that sorting methods and classifiers can be used in such a way that one can reach the best performance with a combination of only very…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStock Market Forecasting Methods · Complex Systems and Time Series Analysis · Innovation Diffusion and Forecasting
MethodsFeature Selection · Logistic Regression
