TL;DR
BioBERT is a domain-specific language model pre-trained on biomedical texts that significantly improves performance on various biomedical text mining tasks compared to general NLP models.
Contribution
This paper introduces BioBERT, a biomedical domain-specific pre-trained language model that outperforms previous models on key biomedical text mining tasks.
Findings
BioBERT outperforms BERT and previous models in biomedical named entity recognition.
BioBERT achieves a 2.80% F1 score improvement in relation extraction.
BioBERT improves biomedical question answering performance by 12.24% MRR.
Abstract
Biomedical text mining is becoming increasingly important as the number of biomedical documents rapidly grows. With the progress in natural language processing (NLP), extracting valuable information from biomedical literature has gained popularity among researchers, and deep learning has boosted the development of effective biomedical text mining models. However, directly applying the advancements in NLP to biomedical text mining often yields unsatisfactory results due to a word distribution shift from general domain corpora to biomedical corpora. In this article, we investigate how the recently introduced pre-trained language model BERT can be adapted for biomedical corpora. We introduce BioBERT (Bidirectional Encoder Representations from Transformers for Biomedical Text Mining), which is a domain-specific language representation model pre-trained on large-scale biomedical corpora.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
- 🤗emilyalsentzer/Bio_ClinicalBERTmodel· 2.9M dl· ♡ 4252.9M dl♡ 425
- 🤗dmis-lab/biobert-large-cased-v1.1-squadmodel· 14k dl· ♡ 2114k dl♡ 21
- 🤗dmis-lab/biosyn-sapbert-ncbi-diseasemodel· 18 dl· ♡ 218 dl♡ 2
- 🤗emilyalsentzer/Bio_Discharge_Summary_BERTmodel· 433k dl· ♡ 38433k dl♡ 38
- 🤗tsantos/PathologyBERTmodel· 831 dl· ♡ 7831 dl♡ 7
- 🤗IVN-RIN/bioBITmodel· 283 dl· ♡ 3283 dl♡ 3
- 🤗mocherson/AD-BERTmodel· 286 dl286 dl
- 🤗arashpcc/Bio_ClinicalBERTmodel· 1 dl· ♡ 11 dl♡ 1
- 🤗disi-unibo-nlp/MedGENIE-fid-flan-t5-base-medqamodel· 5 dl5 dl
- 🤗AKHIL001/Bio_Clinical_BERTmodel· 1 dl1 dl
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
MethodsLinear Layer · Residual Connection · Attention Dropout · Linear Warmup With Linear Decay · Weight Decay · Refunds@Expedia|||How do I get a full refund from Expedia? · Dense Connections · Adam · WordPiece · Softmax
