A Mechanistic Study of Tabular Foundation Models

Marin Bilo\v{s}; James T. Wilson; Anderson Schneider; Yuriy Nevmyvaka

arXiv:2605.21288·cs.LG·May 21, 2026

A Mechanistic Study of Tabular Foundation Models

Marin Bilo\v{s}, James T. Wilson, Anderson Schneider, Yuriy Nevmyvaka

PDF

TL;DR

This paper provides a detailed mechanistic analysis of tabular foundation models, revealing their in-context algorithms, invariances, and robustness characteristics through causal interventions and perturbation experiments.

Contribution

It offers the first comprehensive mechanistic understanding of how different tabular models operate, including their similarity-based readouts and invariance sources.

Findings

01

Models realize distinct similarity-based readouts confirmed by causal intervention.

02

Permutation invariances originate from specific positional parameters, removal preserves accuracy.

03

Engineered perturbations reproduce predicted failure modes and isolate model vulnerabilities.

Abstract

Tabular foundation models with different architectures converge in accuracy across a range of classification and regression tasks. This raises questions a leaderboard cannot answer: (i) whether the models execute the same in-context algorithm, (ii) where row, column, and class-permutation invariances originate, and (iii) how robust they are under perturbations engineered against the inferred mechanism. We characterize all three. The model families realize qualitatively distinct similarity-based readouts: from an attention-weighted vote over context labels to a class-conditional mean readout, each confirmed by causal intervention. We find that the representation collapse highlighted in prior work is not a practical concern for them. Each model's permutation invariances trace to specific positional parameters whose removal preserves accuracy and makes approximate invariance exact.…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.