Representation Learning of Lab Values via Masked AutoEncoders
David Restrepo, Chenwei Wu, Yueran Jia, Jaden K. Sun, Jack Gallifant, Catherine G. Bielick, Yugang Jia, Leo A. Celi

TL;DR
This paper introduces Lab-MAE, a transformer-based masked autoencoder that improves the imputation of missing lab values in EHR data, outperforming existing methods and enhancing fairness across patient groups.
Contribution
We propose Lab-MAE, a novel self-supervised transformer framework that explicitly models temporal dependencies in lab data for improved imputation accuracy.
Findings
Lab-MAE outperforms baselines on MIMIC-IV across RMSE, R2, WD.
It achieves equitable performance across demographic groups.
Lab-MAE demonstrates robustness without follow-up lab data.
Abstract
Accurate imputation of missing laboratory values in electronic health records (EHRs) is critical to enable robust clinical predictions and reduce biases in AI systems in healthcare. Existing methods, such as XGBoost, softimpute, GAIN, Expectation Maximization (EM), and MICE, struggle to model the complex temporal and contextual dependencies in EHR data, particularly in underrepresented groups. In this work, we propose Lab-MAE, a novel transformer-based masked autoencoder framework that leverages self-supervised learning for the imputation of continuous sequential lab values. Lab-MAE introduces a structured encoding scheme that jointly models laboratory test values and their corresponding timestamps, enabling explicit capturing temporal dependencies. Empirical evaluation on the MIMIC-IV dataset demonstrates that Lab-MAE significantly outperforms state-of-the-art baselines such as…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNeural Networks and Applications · Fault Detection and Control Systems · Experimental Learning in Engineering
