Revisiting Pretraining Objectives for Tabular Deep Learning

Ivan Rubachev; Artem Alekberov; Yury Gorishniy; Artem Babenko

arXiv:2207.03208·cs.LG·July 13, 2022·6 cites

Revisiting Pretraining Objectives for Tabular Deep Learning

Ivan Rubachev, Artem Alekberov, Yury Gorishniy, Artem Babenko

PDF

Open Access 2 Repos

TL;DR

This paper investigates various pretraining strategies for tabular deep learning models, demonstrating that proper pretraining, especially with target-aware objectives, can significantly enhance performance and surpass traditional GBDT models.

Contribution

It identifies effective pretraining practices for tabular DL models, highlighting the benefits of target-aware objectives and providing comprehensive comparisons across architectures.

Findings

01

Pretraining with target labels improves downstream performance.

02

Proper pretraining can make tabular DL models outperform GBDTs.

03

Target-aware pretraining objectives are particularly beneficial.

Abstract

Recent deep learning models for tabular data currently compete with the traditional ML models based on decision trees (GBDT). Unlike GBDT, deep models can additionally benefit from pretraining, which is a workhorse of DL for vision and NLP. For tabular problems, several pretraining methods were proposed, but it is not entirely clear if pretraining provides consistent noticeable improvements and what method should be used, since the methods are often not compared to each other or comparison is limited to the simplest MLP architectures. In this work, we aim to identify the best practices to pretrain tabular DL models that can be universally applied to different datasets and architectures. Among our findings, we show that using the object target labels during the pretraining stage is beneficial for the downstream performance and advocate several target-aware pretraining objectives.…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdversarial Robustness in Machine Learning · Machine Learning and Data Classification · Explainable Artificial Intelligence (XAI)