Sensitivity Assessing to Data Volume for forecasting: introducing similarity methods as a suitable one in Feature selection methods
Mahdi Goldani, Soraya Asadi Tirvan

TL;DR
This paper evaluates the stability of feature selection methods in time series forecasting, highlighting the effectiveness of similarity-based methods like Hausdorff and edit distance in maintaining robustness across varying data volumes.
Contribution
It introduces the use of time series similarity methods for feature selection, demonstrating their minimal sensitivity to data volume changes in financial forecasting.
Findings
Variance, edit distance, and Hausdorff methods show low sensitivity to data volume.
Similarity-based feature selection methods are effective for robust forecasting.
Results support using these methods in dynamic, high-dimensional datasets.
Abstract
In predictive modeling, overfitting poses a significant risk, particularly when the feature count surpasses the number of observations, a common scenario in high-dimensional data sets. To mitigate this risk, feature selection is employed to enhance model generalizability by reducing the dimensionality of the data. This study focuses on evaluating the stability of feature selection techniques with respect to varying data volumes, particularly employing time series similarity methods. Utilizing a comprehensive dataset that includes the closing, opening, high, and low prices of stocks from 100 high-income companies listed in the Fortune Global 500, this research compares several feature selection methods including variance thresholds, edit distance, and Hausdorff distance metrics. The aim is to identify methods that show minimal sensitivity to the quantity of data, ensuring robustness and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsForecasting Techniques and Applications
MethodsFeature Selection
