Evaluation of Language Models in the Medical Context Under Resource-Constrained Settings
Andrea Posada, Daniel Rueckert, Felix Meissen, Philip M\"uller

TL;DR
This study evaluates 53 language models for medical text classification and generation, highlighting their potential in resource-limited medical settings and emphasizing the need for further exploration of their applications.
Contribution
It provides a comprehensive survey and evaluation of diverse medical language models, focusing on resource-constrained environments and zero-shot learning approaches.
Findings
Remarkable performance of certain models in medical tasks
Models contain significant medical knowledge without domain-specific training
Zero-shot approaches are effective in resource-limited settings
Abstract
Since the Transformer architecture emerged, language model development has grown, driven by their promising potential. Releasing these models into production requires properly understanding their behavior, particularly in sensitive domains like medicine. Despite this need, the medical literature still lacks practical assessment of pre-trained language models, which are especially valuable in settings where only consumer-grade computational resources are available. To address this gap, we have conducted a comprehensive survey of language models in the medical field and evaluated a subset of these for medical text classification and conditional text generation. The subset includes 53 models with 110 million to 13 billion parameters, spanning the Transformer-based model families and knowledge domains. Different approaches are employed for text classification, including zero-shot learning,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling
MethodsAttention Is All You Need · Softmax · Layer Normalization · Byte Pair Encoding · Label Smoothing · Position-Wise Feed-Forward Layer · Dropout · Adam · Linear Layer · Absolute Position Encodings
