Publicly Available Clinical BERT Embeddings
Emily Alsentzer, John R. Murphy, Willie Boag, Wei-Hung Weng, Di Jin,, Tristan Naumann, Matthew B. A. McDermott

TL;DR
This paper introduces and releases publicly available BERT embeddings trained specifically on clinical text, demonstrating improved performance on several clinical NLP tasks, with some limitations on de-identification tasks.
Contribution
The paper provides the first publicly available clinical BERT models for generic and discharge summaries, enhancing NLP performance in the clinical domain.
Findings
Domain-specific models improve clinical NLP task performance
Models are less effective on de-identification tasks
Releasing these models supports further clinical NLP research
Abstract
Contextual word embedding models such as ELMo (Peters et al., 2018) and BERT (Devlin et al., 2018) have dramatically improved performance for many natural language processing (NLP) tasks in recent months. However, these models have been minimally explored on specialty corpora, such as clinical text; moreover, in the clinical domain, no publicly-available pre-trained BERT models yet exist. In this work, we address this need by exploring and releasing BERT models for clinical text: one for generic clinical text and another for discharge summaries specifically. We demonstrate that using a domain-specific model yields performance improvements on three common clinical NLP tasks as compared to nonspecific embeddings. These domain-specific models are not as performant on two clinical de-identification tasks, and argue that this is a natural consequence of the differences between de-identified…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
- 🤗emilyalsentzer/Bio_ClinicalBERTmodel· 2.9M dl· ♡ 4252.9M dl♡ 425
- 🤗emilyalsentzer/Bio_Discharge_Summary_BERTmodel· 433k dl· ♡ 38433k dl♡ 38
- 🤗obi/deid_bert_i2b2model· 1.8k dl· ♡ 231.8k dl♡ 23
- 🤗Charangan/MedBERTmodel· 1.8k dl· ♡ 181.8k dl♡ 18
- 🤗arashpcc/Bio_ClinicalBERTmodel· 1 dl· ♡ 11 dl♡ 1
- 🤗AKHIL001/Bio_Clinical_BERTmodel· 1 dl1 dl
- 🤗BryceAMcDaniel/deid_bert_i2b2-onnxmodel· 14 dl14 dl
- 🤗qwerijsodp/Bio_ClinicalBERTmodel· 10 dl10 dl
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Natural Language Processing Techniques · Text Readability and Simplification
MethodsLinear Layer · Sigmoid Activation · Tanh Activation · Residual Connection · Attention Dropout · Linear Warmup With Linear Decay · Weight Decay · Refunds@Expedia|||How do I get a full refund from Expedia? · Dense Connections · Adam
