Domain-specific or Uncertainty-aware models: Does it really make a difference for biomedical text classification?
Aman Sinha, Timothee Mickus, Marianne Clausel, Mathieu Constant and, Xavier Coubez

TL;DR
This paper investigates whether domain-specific and uncertainty-aware models significantly improve biomedical text classification, emphasizing the importance of task context over model specialization.
Contribution
It analyzes the interplay between domain-specificity and uncertainty awareness in models, highlighting the dominant influence of task requirements.
Findings
Domain-specific and uncertainty-aware models can be combined effectively.
Task context heavily influences model performance.
Entropy of output distribution reflects model's uncertainty and domain adaptation.
Abstract
The success of pretrained language models (PLMs) across a spate of use-cases has led to significant investment from the NLP community towards building domain-specific foundational models. On the other hand, in mission critical settings such as biomedical applications, other aspects also factor in-chief of which is a model's ability to produce reasonable estimates of its own uncertainty. In the present study, we discuss these two desiderata through the lens of how they shape the entropy of a model's output probability distribution. We find that domain specificity and uncertainty awareness can often be successfully combined, but the exact task at hand weighs in much more strongly.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Taxonomy
TopicsBiomedical Text Mining and Ontologies
