SciBERT: A Pretrained Language Model for Scientific Text
Iz Beltagy, Kyle Lo, Arman Cohan

TL;DR
SciBERT is a specialized pretrained language model based on BERT, designed to improve NLP performance on scientific texts by leveraging large-scale scientific corpora for unsupervised pretraining.
Contribution
It introduces SciBERT, a domain-specific language model trained on scientific literature, achieving superior results on scientific NLP tasks compared to general models like BERT.
Findings
SciBERT outperforms BERT on multiple scientific NLP tasks.
Achieves state-of-the-art results on several scientific datasets.
Demonstrates the effectiveness of domain-specific pretraining.
Abstract
Obtaining large-scale annotated data for NLP tasks in the scientific domain is challenging and expensive. We release SciBERT, a pretrained language model based on BERT (Devlin et al., 2018) to address the lack of high-quality, large-scale labeled scientific data. SciBERT leverages unsupervised pretraining on a large multi-domain corpus of scientific publications to improve performance on downstream scientific NLP tasks. We evaluate on a suite of tasks including sequence tagging, sentence classification and dependency parsing, with datasets from a variety of scientific domains. We demonstrate statistically significant improvements over BERT and achieve new state-of-the-art results on several of these tasks. The code and pretrained models are available at https://github.com/allenai/scibert/.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
- 🤗fran-martinez/scibert_scivocab_cased_ner_jnlpbamodel· 64 dl64 dl
- 🤗lordtt13/COVID-SciBERTmodel· 7 dl· ♡ 27 dl♡ 2
- 🤗casehold/bert-doublemodel· 8 dl· ♡ 28 dl♡ 2
- 🤗tsantos/PathologyBERTmodel· 831 dl· ♡ 7831 dl♡ 7
- 🤗sschet/scibert_scivocab_cased_ner_jnlpbamodel· 3 dl3 dl
- 🤗agsc/scibert-bf16-acceptancemodel· 3 dl· ♡ 13 dl♡ 1
- 🤗simon-clmtd/sdg-scibert-zo_upmodel· 2 dl2 dl
- 🤗asjc-classification/scibert_multilabel_asjc_classifiermodel· 21 dl21 dl
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Natural Language Processing Techniques · Biomedical Text Mining and Ontologies
MethodsLinear Layer · Residual Connection · Attention Dropout · Linear Warmup With Linear Decay · Weight Decay · Refunds@Expedia|||How do I get a full refund from Expedia? · Dense Connections · Adam · WordPiece · Softmax
