Domain-specific or Uncertainty-aware models: Does it really make a   difference for biomedical text classification?

Aman Sinha; Timothee Mickus; Marianne Clausel; Mathieu Constant and; Xavier Coubez

arXiv:2407.12626·cs.CL·July 18, 2024

Domain-specific or Uncertainty-aware models: Does it really make a difference for biomedical text classification?

Aman Sinha, Timothee Mickus, Marianne Clausel, Mathieu Constant and, Xavier Coubez

PDF

Open Access 1 Video

TL;DR

This paper investigates whether domain-specific and uncertainty-aware models significantly improve biomedical text classification, emphasizing the importance of task context over model specialization.

Contribution

It analyzes the interplay between domain-specificity and uncertainty awareness in models, highlighting the dominant influence of task requirements.

Findings

01

Domain-specific and uncertainty-aware models can be combined effectively.

02

Task context heavily influences model performance.

03

Entropy of output distribution reflects model's uncertainty and domain adaptation.

Abstract

The success of pretrained language models (PLMs) across a spate of use-cases has led to significant investment from the NLP community towards building domain-specific foundational models. On the other hand, in mission critical settings such as biomedical applications, other aspects also factor in-chief of which is a model's ability to produce reasonable estimates of its own uncertainty. In the present study, we discuss these two desiderata through the lens of how they shape the entropy of a model's output probability distribution. We find that domain specificity and uncertainty awareness can often be successfully combined, but the exact task at hand weighs in much more strongly.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

Domain-specific or Uncertainty-aware models: Does it really make a difference for biomedical text classification?· underline

Taxonomy

TopicsBiomedical Text Mining and Ontologies