Irredundant $k$-Fold Cross-Validation

Jesus S. Aguilar-Ruiz

arXiv:2507.20048·cs.LG·August 29, 2025

Irredundant $k$-Fold Cross-Validation

Jesus S. Aguilar-Ruiz

PDF

TL;DR

Irredundant $k$-fold cross-validation is a new method that uses each dataset instance exactly once for training and testing, reducing redundancy, overfitting, and computational costs while maintaining reliable performance estimates.

Contribution

The paper introduces a novel irredundant $k$-fold cross-validation method that ensures each instance is used exactly once for training and testing, improving dataset utilization and analysis accuracy.

Findings

01

Provides consistent performance estimates across datasets

02

Reduces variance in model evaluation

03

Lowers computational costs compared to traditional methods

Abstract

In traditional k-fold cross-validation, each instance is used ( $k - 1$ ) times for training and once for testing, leading to redundancy that lets many instances disproportionately influence the learning phase. We introduce Irredundant $k$ -fold cross-validation, a novel method that guarantees each instance is used exactly once for training and once for testing across the entire validation procedure. This approach ensures a more balanced utilization of the dataset, mitigates overfitting due to instance repetition, and enables sharper distinctions in comparative model analysis. The method preserves stratification and remains model-agnostic, i.e., compatible with any classifier. Experimental results demonstrate that it delivers consistent performance estimates across diverse datasets -- comparable to $k$ -fold cross-validation -- while providing less optimistic variance estimates because…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.