TabReD: Analyzing Pitfalls and Filling the Gaps in Tabular Deep Learning Benchmarks
Ivan Rubachev, Nikolay Kartashev, Yury Gorishniy, Artem Babenko

TL;DR
This paper introduces TabReD, a collection of industry-grade tabular datasets, and analyzes how recent deep learning methods perform under real-world conditions like data drift and feature engineering, revealing different rankings than traditional benchmarks.
Contribution
The paper presents TabReD, a new benchmark dataset collection, and evaluates existing models under realistic industrial scenarios, highlighting the importance of time-based splits and feature considerations.
Findings
Time-based splits change model rankings.
Simple MLP and GBDT perform best on TabReD.
Current benchmarks may not reflect real-world challenges.
Abstract
Advances in machine learning research drive progress in real-world applications. To ensure this progress, it is important to understand the potential pitfalls on the way from a novel method's success on academic benchmarks to its practical deployment. In this work, we analyze existing tabular benchmarks and find two common characteristics of tabular data in typical industrial applications that are underrepresented in the datasets usually used for evaluation in the literature. First, in real-world deployment scenarios, distribution of data often changes over time. To account for this distribution drift, time-based train/test splits should be used in evaluation. However, popular tabular datasets often lack timestamp metadata to enable such evaluation. Second, a considerable portion of datasets in production settings stem from extensive data acquisition and feature engineering pipelines.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsData Stream Mining Techniques · Machine Learning and Data Classification
