Modeling Disagreement in Automatic Data Labelling for Semi-Supervised   Learning in Clinical Natural Language Processing

Hongshu Liu; Nabeel Seedat; Julia Ive

arXiv:2205.14761·cs.LG·June 9, 2022·1 cites

Modeling Disagreement in Automatic Data Labelling for Semi-Supervised Learning in Clinical Natural Language Processing

Hongshu Liu, Nabeel Seedat, Julia Ive

PDF

Open Access

TL;DR

This paper evaluates the uncertainty estimation capabilities of various models in clinical NLP, showing Gaussian Processes outperform others in quantifying risks in radiology report analysis.

Contribution

It introduces a comparative analysis of uncertainty estimation methods in healthcare NLP, highlighting the effectiveness of Gaussian Processes for risk quantification.

Findings

01

Gaussian Processes outperform other models in uncertainty quantification.

02

GPs provide better risk estimates with strong predictive performance.

03

Uncertainty estimation improves decision-making in clinical NLP applications.

Abstract

Computational models providing accurate estimates of their uncertainty are crucial for risk management associated with decision making in healthcare contexts. This is especially true since many state-of-the-art systems are trained using the data which has been labelled automatically (self-supervised mode) and tend to overfit. In this work, we investigate the quality of uncertainty estimates from a range of current state-of-the-art predictive models applied to the problem of observation detection in radiology reports. This problem remains understudied for Natural Language Processing in the healthcare domain. We demonstrate that Gaussian Processes (GPs) provide superior performance in quantifying the risks of 3 uncertainty labels based on the negative log predictive probability (NLPP) evaluation metric and mean maximum predicted confidence levels (MMPCL), whilst retaining strong…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMachine Learning in Healthcare · Explainable Artificial Intelligence (XAI) · Artificial Intelligence in Healthcare and Education