MedicalBERT: enhancing biomedical natural language processing using pretrained BERT-based model

K. Sahit Reddy; N. Ragavenderan; Vasanth K.; Ganesh N. Naik; Vishalakshi Prabhu; Nagaraja G. S

arXiv:2507.08013·cs.CL·July 28, 2025

MedicalBERT: enhancing biomedical natural language processing using pretrained BERT-based model

K. Sahit Reddy, N. Ragavenderan, Vasanth K., Ganesh N. Naik, Vishalakshi Prabhu, Nagaraja G. S

PDF

TL;DR

MedicalBERT is a domain-specific pretrained BERT model tailored for biomedical NLP tasks, significantly outperforming existing models like BioBERT and SciBERT across various benchmarks by leveraging large biomedical datasets and specialized vocabulary.

Contribution

This paper introduces MedicalBERT, a pretrained BERT model optimized for biomedical NLP, demonstrating superior performance on multiple biomedical text understanding tasks.

Findings

01

MedicalBERT outperforms BioBERT, SciBERT, and ClinicalBERT on most benchmarks.

02

MedicalBERT achieves an average improvement of 5.67% over general BERT across evaluated tasks.

03

Fine-tuning MedicalBERT enhances its effectiveness in diverse biomedical NLP applications.

Abstract

Recent advances in natural language processing (NLP) have been driven bypretrained language models like BERT, RoBERTa, T5, and GPT. Thesemodels excel at understanding complex texts, but biomedical literature, withits domain-specific terminology, poses challenges that models likeWord2Vec and bidirectional long short-term memory (Bi-LSTM) can't fullyaddress. GPT and T5, despite capturing context, fall short in tasks needingbidirectional understanding, unlike BERT. Addressing this, we proposedMedicalBERT, a pretrained BERT model trained on a large biomedicaldataset and equipped with domain-specific vocabulary that enhances thecomprehension of biomedical terminology. MedicalBERT model is furtheroptimized and fine-tuned to address diverse tasks, including named entityrecognition, relation extraction, question answering, sentence similarity, anddocument classification. Performance metrics…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.