TabH2O: A Unified Foundation Model for Tabular Prediction
Pascal Pfeiffer, Dmitry Gordeev, Mathias M\"uller, Laura Fink, Joan Salv\`a Soler, Mark Landry, Branden Murray, Marcos V. Conde, and Sri Satish Ambati

TL;DR
TabH2O is a unified foundation model for tabular data that efficiently handles classification and regression tasks through in-context learning, with novel training techniques for robustness and stability.
Contribution
It introduces a single, dual-head model trained with a simplified, noise-aware pretraining process for improved tabular prediction performance.
Findings
Achieves top-3 performance on 81% of datasets in TALENT benchmark.
Outperforms traditional models like CatBoost, H2O AutoML, and LightGBM.
Competitive with state-of-the-art TabPFN v2, placing second in average rank.
Abstract
We present TabH2O, a foundation model for tabular data that performs classification and regression in a single forward pass via in-context learning. TabH2O builds on the TabICL architecture with several key modifications: (1) unified training, a single model handles both classification and regression via a dual-head architecture, eliminating the need for separate models and reducing total pretraining cost; (2) single-stage pretraining, training stability improvements (bounded scalable softmax, inter-stage normalization, learnable residual scaling, logit soft-capping) eliminate the need for multi-stage curriculum learning, enabling training with full-length sequences from the start; and (3) noise-aware pretraining, synthetic datasets include explicit noise dimensions to teach the model robustness to irrelevant features. We evaluate TabH2O v1 (29.2M parameters) on the TALENT benchmark…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
