BioBERT: a pre-trained biomedical language representation model for   biomedical text mining

Jinhyuk Lee; Wonjin Yoon; Sungdong Kim; Donghyeon Kim; Sunkyu Kim,; Chan Ho So; Jaewoo Kang

arXiv:1901.08746·cs.CL·October 21, 2019

BioBERT: a pre-trained biomedical language representation model for biomedical text mining

Jinhyuk Lee, Wonjin Yoon, Sungdong Kim, Donghyeon Kim, Sunkyu Kim,, Chan Ho So, Jaewoo Kang

PDF

5 Repos 10 Models

TL;DR

BioBERT is a domain-specific language model pre-trained on biomedical texts that significantly improves performance on various biomedical text mining tasks compared to general NLP models.

Contribution

This paper introduces BioBERT, a biomedical domain-specific pre-trained language model that outperforms previous models on key biomedical text mining tasks.

Findings

01

BioBERT outperforms BERT and previous models in biomedical named entity recognition.

02

BioBERT achieves a 2.80% F1 score improvement in relation extraction.

03

BioBERT improves biomedical question answering performance by 12.24% MRR.

Abstract

Biomedical text mining is becoming increasingly important as the number of biomedical documents rapidly grows. With the progress in natural language processing (NLP), extracting valuable information from biomedical literature has gained popularity among researchers, and deep learning has boosted the development of effective biomedical text mining models. However, directly applying the advancements in NLP to biomedical text mining often yields unsatisfactory results due to a word distribution shift from general domain corpora to biomedical corpora. In this article, we investigate how the recently introduced pre-trained language model BERT can be adapted for biomedical corpora. We introduce BioBERT (Bidirectional Encoder Representations from Transformers for Biomedical Text Mining), which is a domain-specific language representation model pre-trained on large-scale biomedical corpora.…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Models

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

MethodsLinear Layer · Residual Connection · Attention Dropout · Linear Warmup With Linear Decay · Weight Decay · Refunds@Expedia|||How do I get a full refund from Expedia? · Dense Connections · Adam · WordPiece · Softmax