T2G-Former: Organizing Tabular Features into Relation Graphs Promotes Heterogeneous Feature Interaction
Jiahuan Yan, Jintai Chen, Yixuan Wu, Danny Z. Chen, Jian Wu

TL;DR
T2G-Former introduces a graph-based Transformer model for tabular data that automatically learns feature relations, promoting effective heterogeneous feature interaction and achieving state-of-the-art results.
Contribution
The paper proposes a novel Graph Estimator and a Transformer architecture tailored for tabular data, effectively modeling feature relations to enhance interaction.
Findings
T2G-Former outperforms existing DNNs on tabular tasks.
It achieves competitive results with gradient boosting models.
The method effectively captures heterogeneous feature relations.
Abstract
Recent development of deep neural networks (DNNs) for tabular learning has largely benefited from the capability of DNNs for automatic feature interaction. However, the heterogeneity nature of tabular features makes such features relatively independent, and developing effective methods to promote tabular feature interaction still remains an open problem. In this paper, we propose a novel Graph Estimator, which automatically estimates the relations among tabular features and builds graphs by assigning edges between related features. Such relation graphs organize independent tabular features into a kind of graph data such that interaction of nodes (tabular features) can be conducted in an orderly fashion. Based on our proposed Graph Estimator, we present a bespoke Transformer network tailored for tabular learning, called T2G-Former, which processes tabular data by performing tabular…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
TopicsAdvanced Graph Neural Networks · Advanced Neural Network Applications · Explainable Artificial Intelligence (XAI)
MethodsAttention Is All You Need · Layer Normalization · Adam · Absolute Position Encodings · Softmax · Dropout · Byte Pair Encoding · Position-Wise Feed-Forward Layer · Label Smoothing · Linear Layer
