TL;DR
xRFM is a novel feature learning model for tabular data that combines kernel machines with tree structures, achieving superior performance and interpretability over existing methods.
Contribution
It introduces xRFM, a scalable and interpretable model that outperforms 31 methods, including GBDTs and foundation models, across diverse regression and classification datasets.
Findings
xRFM outperforms 31 other methods on 100 regression datasets.
xRFM is competitive with top methods on 200 classification datasets.
xRFM provides native interpretability via the Average Gradient Outer Product.
Abstract
Inference from tabular data, collections of continuous and categorical variables organized into matrices, is a foundation for modern technology and science. Yet, in contrast to the explosive changes in the rest of AI, the best practice for these predictive tasks has been relatively unchanged and is still primarily based on variations of Gradient Boosted Decision Trees (GBDTs). Very recently, there has been renewed interest in developing state-of-the-art methods for tabular data based on recent developments in neural networks and feature learning methods. In this work, we introduce xRFM, an algorithm that combines feature learning kernel machines with a tree structure to both adapt to the local structure of the data and scale to essentially unlimited amounts of training data. We show that compared to other methods, including recently introduced tabular foundation models (TabPFNv2)…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
