AMMU : A Survey of Transformer-based Biomedical Pretrained Language   Models

Katikapalli Subramanyam Kalyan; Ajit Rajasekharan; Sivanesan Sangeetha

arXiv:2105.00827·cs.CL·September 3, 2021

AMMU : A Survey of Transformer-based Biomedical Pretrained Language Models

Katikapalli Subramanyam Kalyan, Ajit Rajasekharan, Sivanesan Sangeetha

PDF

TL;DR

This survey comprehensively reviews transformer-based biomedical pretrained language models, covering their foundational concepts, taxonomy, challenges, and future research directions in the biomedical NLP domain.

Contribution

It provides the first detailed taxonomy and analysis of biomedical transformer-based PLMs, summarizing core concepts, models, challenges, and open issues.

Findings

01

Various transformer-based BPLMs have been developed for biomedical NLP.

02

Pretraining methods and tasks vary across models, impacting performance.

03

Open issues include data scarcity and model interpretability.

Abstract

Transformer-based pretrained language models (PLMs) have started a new era in modern natural language processing (NLP). These models combine the power of transformers, transfer learning, and self-supervised learning (SSL). Following the success of these models in the general domain, the biomedical research community has developed various in-domain PLMs starting from BioBERT to the latest BioELECTRA and BioALBERT models. We strongly believe there is a need for a survey paper that can provide a comprehensive survey of various transformer-based biomedical pretrained language models (BPLMs). In this survey, we start with a brief overview of foundational concepts like self-supervised learning, embedding layer and transformer encoder layers. We discuss core concepts of transformer-based PLMs like pretraining methods, pretraining tasks, fine-tuning methods, and various embedding types specific…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.