TILBench: A Systematic Benchmark for Tabular Imbalanced Learning Across Data Regimes

Ruizhe Liu; Jiaqi Luo

arXiv:2605.14915·cs.LG·May 15, 2026

TILBench: A Systematic Benchmark for Tabular Imbalanced Learning Across Data Regimes

Ruizhe Liu, Jiaqi Luo

PDF

TL;DR

TILBench is a comprehensive benchmark evaluating over 40 algorithms across 57 datasets to understand their performance, robustness, and scalability in tabular imbalanced learning.

Contribution

It introduces TILBench, a large-scale empirical benchmark providing systematic comparisons of imbalanced learning methods across diverse data regimes.

Findings

01

No single method dominates across all settings.

02

Method effectiveness depends on dataset characteristics and computational constraints.

03

Practical recommendations are provided for method selection.

Abstract

Imbalanced learning remains a fundamental challenge in tabular data applications. Despite decades of research and numerous proposed algorithms, a systematic empirical understanding of how different imbalanced learning methods behave across diverse data characteristics is still lacking. In particular, it remains unclear how different method families compare in predictive performance, robustness under varying data characteristics, and computational scalability. In this work, we present Tabular Imbalanced Learning Benchmark (TILBench), a large-scale empirical benchmark for tabular imbalanced learning. TILBench evaluates more than 40 representative algorithms across 57 diverse tabular datasets, resulting in over 200000 controlled experiments across a wide range of data characteristics. Our findings show that no single method consistently dominates across all settings; instead, the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.