Med-BERT: pre-trained contextualized embeddings on large-scale   structured electronic health records for disease prediction

Laila Rasmy; Yang Xiang; Ziqian Xie; Cui Tao; Degui Zhi

arXiv:2005.12833·cs.CL·May 27, 2020·61 cites

Med-BERT: pre-trained contextualized embeddings on large-scale structured electronic health records for disease prediction

Laila Rasmy, Yang Xiang, Ziqian Xie, Cui Tao, Degui Zhi

PDF

Open Access 1 Repo

TL;DR

Med-BERT adapts the BERT framework to structured electronic health records, significantly improving disease prediction accuracy, especially in small datasets, and facilitating AI integration in healthcare.

Contribution

This paper introduces Med-BERT, a novel pre-trained model for structured EHR data, enhancing disease prediction performance and enabling effective use of small datasets.

Findings

01

Boosts AUC by 2.02-7.12% in disease prediction tasks.

02

Significantly improves performance with small training sets (300-500 samples).

03

Reduces data collection costs and accelerates AI adoption in healthcare.

Abstract

Deep learning (DL) based predictive models from electronic health records (EHR) deliver impressive performance in many clinical tasks. Large training cohorts, however, are often required to achieve high accuracy, hindering the adoption of DL-based models in scenarios with limited training data size. Recently, bidirectional encoder representations from transformers (BERT) and related models have achieved tremendous successes in the natural language processing domain. The pre-training of BERT on a very large training corpus generates contextualized embeddings that can boost the performance of models trained on smaller datasets. We propose Med-BERT, which adapts the BERT framework for pre-training contextualized embedding models on structured diagnosis data from 28,490,650 patients EHR dataset. Fine-tuning experiments are conducted on two disease-prediction tasks: (1) prediction of heart…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

ZhiGroup/Med-BERT
tfOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMachine Learning in Healthcare · Artificial Intelligence in Healthcare · Artificial Intelligence in Healthcare and Education

MethodsLinear Layer · Weight Decay · Softmax · Adam · Multi-Head Attention · Dropout · Refunds@Expedia|||How do I get a full refund from Expedia? · Attention Dropout · Linear Warmup With Linear Decay · Dense Connections