Early Prediction of Type 2 Diabetes Using Multimodal data and Tabular Transformers
Sulaiman Khan, Md. Rafiul Biswas, Zubair Shah

TL;DR
This paper presents a novel tabular transformer model for early prediction of Type 2 Diabetes using multimodal longitudinal health data, outperforming traditional methods and AI models in a Qatari cohort.
Contribution
Introduces TabTrans, a new transformer-based architecture for analyzing complex tabular healthcare data for early T2DM prediction, integrating multimodal data sources.
Findings
TabTrans achieved ROC AUC ≥ 79.7% in T2DM prediction.
Key predictors include VAT, BMD, BMC, and bone scores.
Model outperformed conventional ML and generative AI models.
Abstract
This study introduces a novel approach for early Type 2 Diabetes Mellitus (T2DM) risk prediction using a tabular transformer (TabTrans) architecture to analyze longitudinal patient data. By processing patients` longitudinal health records and bone-related tabular data, our model captures complex, long-range dependencies in disease progression that conventional methods often overlook. We validated our TabTrans model on a retrospective Qatar BioBank (QBB) cohort of 1,382 subjects, comprising 725 men (146 diabetic, 579 healthy) and 657 women (133 diabetic, 524 healthy). The study integrated electronic health records (EHR) with dual-energy X-ray absorptiometry (DXA) data. To address class imbalance, we employed SMOTE and SMOTE-ENN resampling techniques. The proposed model`s performance is evaluated against conventional machine learning (ML) and generative AI models, including Claude 3.5…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning in Healthcare · Artificial Intelligence in Healthcare · Bone health and osteoporosis research
