Distribution Shift Aware Neural Tabular Learning

Wangyang Ying; Nanxu Gong; Dongjie Wang; Xinyuan Wang; Arun Vignesh Malarkkan; Vivek Gupta; Chandan K. Reddy; Yanjie Fu

arXiv:2508.19486·cs.LG·August 28, 2025

Distribution Shift Aware Neural Tabular Learning

Wangyang Ying, Nanxu Gong, Dongjie Wang, Xinyuan Wang, Arun Vignesh Malarkkan, Vivek Gupta, Chandan K. Reddy, Yanjie Fu

PDF

TL;DR

This paper introduces SAFT, a novel framework for improving neural tabular learning robustness under distribution shifts by transforming features into a continuous, optimizable space, leading to better generalization.

Contribution

The paper formalizes the DSTL problem and proposes SAFT, a shift-aware feature transformation method that enhances robustness and generalization in tabular learning under distribution shifts.

Findings

01

SAFT outperforms prior methods under diverse distribution shifts.

02

SAFT improves robustness and generalization in real-world scenarios.

03

Extensive experiments validate the effectiveness of SAFT.

Abstract

Tabular learning transforms raw features into optimized spaces for downstream tasks, but its effectiveness deteriorates under distribution shifts between training and testing data. We formalize this challenge as the Distribution Shift Tabular Learning (DSTL) problem and propose a novel Shift-Aware Feature Transformation (SAFT) framework to address it. SAFT reframes tabular learning from a discrete search task into a continuous representation-generation paradigm, enabling differentiable optimization over transformed feature sets. SAFT integrates three mechanisms to ensure robustness: (i) shift-resistant representation via embedding decorrelation and sample reweighting, (ii) flatness-aware generation through suboptimal embedding averaging, and (iii) normalization-based alignment between training and test distributions. Extensive experiments show that SAFT consistently outperforms prior…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.