Enhancing the prediction of disease outcomes using electronic health records and pretrained deep learning models
Zhichao Yang, Weisong Liu, Dan Berlowitz, Hong Yu

TL;DR
This study demonstrates that a pretrained encoder-decoder deep learning model significantly enhances the accuracy of predicting various patient outcomes from electronic health records, outperforming existing models.
Contribution
Introduces a denoising sequence-to-sequence model pretrained on large EHR datasets that improves outcome prediction accuracy over current state-of-the-art models.
Findings
Outperformed pretrained BERT on multiple outcomes
Effective in predicting self-harm and pancreatic cancer
Deep bidirectional autoregressive representations enhance predictions
Abstract
Question: Can an encoder-decoder architecture pretrained on a large dataset of longitudinal electronic health records improves patient outcome predictions? Findings: In this prognostic study of 6.8 million patients, our denoising sequence-to-sequence prediction model of multiple outcomes outperformed state-of-the-art models scuh pretrained BERT on a broad range of patient outcomes, including intentional self-harm and pancreatic cancer. Meaning: Deep bidirectional and autoregressive representation improves patient outcome prediction.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning in Healthcare · AI in cancer detection · Artificial Intelligence in Healthcare
MethodsMulti-Head Attention · Attention Is All You Need · Linear Layer · Weight Decay · Dense Connections · Linear Warmup With Linear Decay · Softmax · WordPiece · Layer Normalization · Adam
