TabReD: Analyzing Pitfalls and Filling the Gaps in Tabular Deep Learning   Benchmarks

Ivan Rubachev; Nikolay Kartashev; Yury Gorishniy; Artem Babenko

arXiv:2406.19380·cs.LG·October 25, 2024·1 cites

TabReD: Analyzing Pitfalls and Filling the Gaps in Tabular Deep Learning Benchmarks

Ivan Rubachev, Nikolay Kartashev, Yury Gorishniy, Artem Babenko

PDF

Open Access 2 Repos

TL;DR

This paper introduces TabReD, a collection of industry-grade tabular datasets, and analyzes how recent deep learning methods perform under real-world conditions like data drift and feature engineering, revealing different rankings than traditional benchmarks.

Contribution

The paper presents TabReD, a new benchmark dataset collection, and evaluates existing models under realistic industrial scenarios, highlighting the importance of time-based splits and feature considerations.

Findings

01

Time-based splits change model rankings.

02

Simple MLP and GBDT perform best on TabReD.

03

Current benchmarks may not reflect real-world challenges.

Abstract

Advances in machine learning research drive progress in real-world applications. To ensure this progress, it is important to understand the potential pitfalls on the way from a novel method's success on academic benchmarks to its practical deployment. In this work, we analyze existing tabular benchmarks and find two common characteristics of tabular data in typical industrial applications that are underrepresented in the datasets usually used for evaluation in the literature. First, in real-world deployment scenarios, distribution of data often changes over time. To account for this distribution drift, time-based train/test splits should be used in evaluation. However, popular tabular datasets often lack timestamp metadata to enable such evaluation. Second, a considerable portion of datasets in production settings stem from extensive data acquisition and feature engineering pipelines.…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsData Stream Mining Techniques · Machine Learning and Data Classification