TabSeq: A Framework for Deep Learning on Tabular Data via Sequential   Ordering

Al Zadid Sultan Bin Habib; Kesheng Wang; Mary-Anne Hartley; Gianfranco; Doretto; Donald A. Adjeroh

arXiv:2410.13203·cs.LG·October 22, 2024

TabSeq: A Framework for Deep Learning on Tabular Data via Sequential Ordering

Al Zadid Sultan Bin Habib, Kesheng Wang, Mary-Anne Hartley, Gianfranco, Doretto, Donald A. Adjeroh

PDF

Open Access 1 Repo

TL;DR

TabSeq introduces a novel feature ordering framework using clustering and attention mechanisms to enhance deep learning performance on heterogeneous tabular data.

Contribution

The paper presents a new feature ordering technique based on clustering and attention, improving deep learning on tabular data by reducing redundancy and emphasizing important features.

Findings

01

Improved deep learning performance on biomedical datasets.

02

Effective feature organization enhances model learning capacity.

03

Feature ordering reduces data redundancy and highlights key features.

Abstract

Effective analysis of tabular data still poses a significant problem in deep learning, mainly because features in tabular datasets are often heterogeneous and have different levels of relevance. This work introduces TabSeq, a novel framework for the sequential ordering of features, addressing the vital necessity to optimize the learning process. Features are not always equally informative, and for certain deep learning models, their random arrangement can hinder the model's learning capacity. Finding the optimum sequence order for such features could improve the deep learning models' learning process. The novel feature ordering technique we provide in this work is based on clustering and incorporates both local ordering and global ordering. It is designed to be used with a multi-head attention mechanism in a denoising autoencoder network. Our framework uses clustering to align…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

zadid6pretam/TabSeq
tfOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Database Systems and Queries

MethodsAttention Is All You Need · Linear Layer · Softmax · Multi-Head Attention · ALIGN · Denoising Autoencoder