SciFive: a text-to-text transformer model for biomedical literature
Long N. Phan, James T. Anibal, Hieu Tran, Shaurya Chanana, Erol, Bahadroglu, Alec Peltekian, Gr\'egoire Altan-Bonnet

TL;DR
SciFive is a domain-specific text-to-text transformer model trained on biomedical data, outperforming existing models on various NLP tasks and highlighting the potential of text-generation methods in biomedical NLP.
Contribution
Introduces SciFive, a pre-trained biomedical T5 model that achieves state-of-the-art results across multiple NLP tasks in the biomedical domain.
Findings
SciFive outperforms BERT, BioBERT, and Base T5 on key biomedical NLP tasks.
Text-generation methods show significant potential for complex biomedical NLP tasks.
Results encourage further exploration of challenging text generation applications.
Abstract
In this report, we introduce SciFive, a domain-specific T5 model that has been pre-trained on large biomedical corpora. Our model outperforms the current SOTA methods (i.e. BERT, BioBERT, Base T5) on tasks in named entity relation, relation extraction, natural language inference, and question-answering. We show that text-generation methods have significant potential in a broad array of biomedical NLP tasks, particularly those requiring longer, more complex outputs. Our results support the exploration of more difficult text generation tasks and the development of new methods in this area
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
- 🤗razent/SciFive-base-PMCmodel· 112 dl112 dl
- 🤗razent/SciFive-base-Pubmedmodel· 265 dl· ♡ 4265 dl♡ 4
- 🤗razent/SciFive-base-Pubmed_PMCmodel· 4.4k dl· ♡ 74.4k dl♡ 7
- 🤗razent/SciFive-large-PMCmodel· 6 dl· ♡ 16 dl♡ 1
- 🤗razent/SciFive-large-Pubmedmodel· 57 dl· ♡ 257 dl♡ 2
- 🤗razent/SciFive-large-Pubmed_PMCmodel· 187 dl· ♡ 10187 dl♡ 10
- 🤗razent/SciFive-large-Pubmed_PMC-MedNLImodel· 86 dl· ♡ 286 dl♡ 2
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Biomedical Text Mining and Ontologies · Natural Language Processing Techniques
MethodsGated Linear Unit · Refunds@Expedia|||How do I get a full refund from Expedia? · Multi-Head Attention · Attention Is All You Need · Linear Layer · Byte Pair Encoding · Adam · Inverse Square Root Schedule · Linear Warmup With Linear Decay · SentencePiece
