A Survey on Deep Tabular Learning
Shriyank Somvanshi, Subasish Das, Syed Aaqib Javed, Gian Antariksa,, Ahmed Hossain

TL;DR
This survey reviews the evolution of deep learning models for tabular data, highlighting architectures like TabNet, SAINT, and hybrid models, and discusses recent advances in scalability, interpretability, and data augmentation techniques.
Contribution
It provides a comprehensive overview of recent deep learning architectures for tabular data, including novel models and future research directions.
Findings
Attention mechanisms improve interpretability and scalability.
Hybrid models effectively handle heterogeneous data types.
Diffusion models and transfer learning enhance robustness and data efficiency.
Abstract
Tabular data, widely used in industries like healthcare, finance, and transportation, presents unique challenges for deep learning due to its heterogeneous nature and lack of spatial structure. This survey reviews the evolution of deep learning models for tabular data, from early fully connected networks (FCNs) to advanced architectures like TabNet, SAINT, TabTranSELU, and MambaNet. These models incorporate attention mechanisms, feature embeddings, and hybrid architectures to address tabular data complexities. TabNet uses sequential attention for instance-wise feature selection, improving interpretability, while SAINT combines self-attention and intersample attention to capture complex interactions across features and data points, both advancing scalability and reducing computational overhead. Hybrid architectures such as TabTransformer and FT-Transformer integrate attention mechanisms…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsVideo Analysis and Summarization
MethodsGated Linear Unit · Attention Is All You Need · Gated Adaptive Network for Deep Automated Learning of Features · Linear Layer · Softmax · Multi-Head Attention · Layer Normalization · Batch Normalization · Dense Connections · Residual Connection
