Lightweight Transformers for Clinical Natural Language Processing
Omid Rohanian, Mohammadmahdi Nouriborji, Hannah Jauncey, Samaneh, Kouchaki, ISARIC Clinical Characterisation Group, Lei Clifton, Laura Merson,, David A. Clifton

TL;DR
This paper introduces a set of lightweight transformer models tailored for clinical NLP tasks, achieving comparable performance to larger models while being more resource-efficient, and demonstrates their effectiveness across multiple clinical text-mining benchmarks.
Contribution
Developed and evaluated compact clinical transformers using knowledge distillation and continual learning, filling a gap in resource-efficient models for clinical NLP.
Findings
Models with 15-65 million parameters perform comparably to larger models.
Outperform other compact models trained on general or biomedical data.
Effective across diverse clinical NLP tasks.
Abstract
Specialised pre-trained language models are becoming more frequent in NLP since they can potentially outperform models trained on generic texts. BioBERT and BioClinicalBERT are two examples of such models that have shown promise in medical NLP tasks. Many of these models are overparametrised and resource-intensive, but thanks to techniques like Knowledge Distillation (KD), it is possible to create smaller versions that perform almost as well as their larger counterparts. In this work, we specifically focus on development of compact language models for processing clinical texts (i.e. progress notes, discharge summaries etc). We developed a number of efficient lightweight clinical transformers using knowledge distillation and continual learning, with the number of parameters ranging from 15 million to 65 million. These models performed comparably to larger models such as BioBERT and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Natural Language Processing Techniques · Text Readability and Simplification
MethodsKnowledge Distillation
