Bi-Axial Transformers: Addressing the Increasing Complexity of EHR Classification
Rachael DeVries, Casper Christensen, Marie Lisandra Zepeda Mendoza, Ole Winther

TL;DR
The paper introduces Bi-Axial Transformer (BAT), a novel model for EHR classification that captures complex data relationships, improves robustness to missing data, and achieves state-of-the-art results in sepsis prediction.
Contribution
We propose the Bi-Axial Transformer, a new architecture that attends to both variable and time axes in EHR data, enhancing modeling of data relationships and handling missingness.
Findings
BAT achieves state-of-the-art sepsis prediction performance.
BAT is more robust to data missingness than other transformers.
Re-implemented baseline models are publicly available for benchmarking.
Abstract
Electronic Health Records (EHRs), the digital representation of a patient's medical history, are a valuable resource for epidemiological and clinical research. They are also becoming increasingly complex, with recent trends indicating larger datasets, longer time series, and multi-modal integrations. Transformers, which have rapidly gained popularity due to their success in natural language processing and other domains, are well-suited to address these challenges due to their ability to model long-range dependencies and process data in parallel. But their application to EHR classification remains limited by data representations, which can reduce performance or fail to capture informative missingness. In this paper, we present the Bi-Axial Transformer (BAT), which attends to both the clinical variable and time point axes of EHR data to learn richer data relationships and address the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsECG Monitoring and Analysis
