Comparing Cluster-Based Cross-Validation Strategies for Machine Learning Model Evaluation
Afonso Martini Spezia, Thomas Fontanari, Mariana Recamonde-Mendoza

TL;DR
This paper evaluates cluster-based cross-validation strategies for machine learning, proposing a new method combining Mini Batch K-Means with class stratification, and compares their effectiveness across diverse datasets and algorithms.
Contribution
It introduces a novel cross-validation technique using Mini Batch K-Means with class stratification and provides an extensive experimental comparison of clustering algorithms for data splitting.
Findings
Mini Batch K-Means with class stratification reduces bias and variance on balanced datasets.
Traditional stratified cross-validation performs best on imbalanced datasets.
No clustering algorithm consistently outperforms others across all scenarios.
Abstract
Cross-validation plays a fundamental role in Machine Learning, enabling robust evaluation of model performance and preventing overestimation on training and validation data. However, one of its drawbacks is the potential to create data subsets (folds) that do not adequately represent the diversity of the original dataset, which can lead to biased performance estimates. The objective of this work is to deepen the investigation of cluster-based cross-validation strategies by analyzing the performance of different clustering algorithms through experimental comparison. Additionally, a new cross-validation technique that combines Mini Batch K-Means with class stratification is proposed. Experiments were conducted on 20 datasets (both balanced and imbalanced) using four supervised learning algorithms, comparing cross-validation strategies in terms of bias, variance, and computational cost.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
