Selecting Feature Interactions for Generalized Additive Models by Distilling Foundation Models

Jingyun Jia; Chandan Singh; Rich Caruana; Ben Lengerich

arXiv:2604.13332·cs.LG·April 16, 2026

Selecting Feature Interactions for Generalized Additive Models by Distilling Foundation Models

Jingyun Jia, Chandan Singh, Rich Caruana, Ben Lengerich

PDF

TL;DR

This paper introduces TabDistill, a method that uses foundation models and post-hoc attribution to identify feature interactions, improving GAMs' accuracy and interpretability for tabular data.

Contribution

It proposes a novel approach leveraging foundation models for automatic feature interaction discovery in GAMs, enhancing their predictive performance.

Findings

01

Interactions identified by TabDistill improve GAMs' predictive accuracy.

02

TabDistill effectively captures higher-order and context-dependent feature effects.

03

The method bridges high-capacity models and interpretable additive models.

Abstract

Identifying meaningful feature interactions is a central challenge in building accurate and interpretable models for tabular data. Generalized additive models (GAMs) have shown great success at modeling tabular data, but often rely on heuristic procedures to select interactions, potentially missing higher-order or context-dependent effects. To meet this challenge, we propose TabDistill, a method that leverages tabular foundation models and post-hoc distillation methods. Our key intuition is that tabular foundation models implicitly learn rich, adaptive feature dependencies through large-scale representation learning. Given a dataset, TabDistill first fits a tabular foundation model to the dataset, and then applies a post-hoc interaction attribution method to extract salient feature interactions from it. We evaluate these interactions by then using them as terms in a GAM. Across tasks,…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.