CEHR-BERT: Incorporating temporal information from structured EHR data to improve prediction tasks
Chao Pang (1), Xinzhuo Jiang (1), Krishna S Kalluri (1), Matthew, Spotnitz (1), RuiJun Chen (2), Adler Perotte (1), Karthik Natarajan (1) ((1), Columbia University Irving Medical Center, (2) Geisinger)

TL;DR
CEHR-BERT is a novel BERT-based model that effectively incorporates temporal information from structured EHR data, significantly improving prediction accuracy across multiple clinical tasks.
Contribution
This paper introduces CEHR-BERT, a new adaptation of BERT that integrates temporal data using artificial time tokens and a second learning objective, enhancing clinical prediction models.
Findings
CEHR-BERT outperformed existing models in all prediction tasks.
Model trained on only 5% of data surpassed models trained on full data.
Incremental gains observed with each temporal component added.
Abstract
Embedding algorithms are increasingly used to represent clinical concepts in healthcare for improving machine learning tasks such as clinical phenotyping and disease prediction. Recent studies have adapted state-of-the-art bidirectional encoder representations from transformers (BERT) architecture to structured electronic health records (EHR) data for the generation of contextualized concept embeddings, yet do not fully incorporate temporal data across multiple clinical domains. Therefore we developed a new BERT adaptation, CEHR-BERT, to incorporate temporal information using a hybrid approach by augmenting the input to BERT using artificial time tokens, incorporating time, age, and concept embeddings, and introducing a new second learning objective for visit type. CEHR-BERT was trained on a subset of Columbia University Irving Medical Center-York Presbyterian Hospital's clinical data,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning in Healthcare · Topic Modeling · Artificial Intelligence in Healthcare
MethodsAttention Is All You Need · Linear Layer · Dense Connections · Layer Normalization · Multi-Head Attention · Linear Warmup With Linear Decay · Refunds@Expedia|||How do I get a full refund from Expedia? · WordPiece · Dropout · Residual Connection
