Publicly Available Clinical BERT Embeddings

Emily Alsentzer; John R. Murphy; Willie Boag; Wei-Hung Weng; Di Jin,; Tristan Naumann; Matthew B. A. McDermott

arXiv:1904.03323·cs.CL·June 24, 2019·725 cites

Publicly Available Clinical BERT Embeddings

Emily Alsentzer, John R. Murphy, Willie Boag, Wei-Hung Weng, Di Jin,, Tristan Naumann, Matthew B. A. McDermott

PDF

Open Access 3 Repos 8 Models

TL;DR

This paper introduces and releases publicly available BERT embeddings trained specifically on clinical text, demonstrating improved performance on several clinical NLP tasks, with some limitations on de-identification tasks.

Contribution

The paper provides the first publicly available clinical BERT models for generic and discharge summaries, enhancing NLP performance in the clinical domain.

Findings

01

Domain-specific models improve clinical NLP task performance

02

Models are less effective on de-identification tasks

03

Releasing these models supports further clinical NLP research

Abstract

Contextual word embedding models such as ELMo (Peters et al., 2018) and BERT (Devlin et al., 2018) have dramatically improved performance for many natural language processing (NLP) tasks in recent months. However, these models have been minimally explored on specialty corpora, such as clinical text; moreover, in the clinical domain, no publicly-available pre-trained BERT models yet exist. In this work, we address this need by exploring and releasing BERT models for clinical text: one for generic clinical text and another for discharge summaries specifically. We demonstrate that using a domain-specific model yields performance improvements on three common clinical NLP tasks as compared to nonspecific embeddings. These domain-specific models are not as performant on two clinical de-identification tasks, and argue that this is a natural consequence of the differences between de-identified…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Models

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Text Readability and Simplification

MethodsLinear Layer · Sigmoid Activation · Tanh Activation · Residual Connection · Attention Dropout · Linear Warmup With Linear Decay · Weight Decay · Refunds@Expedia|||How do I get a full refund from Expedia? · Dense Connections · Adam