Exploring Predictive Uncertainty and Calibration in NLP: A Study on the   Impact of Method & Data Scarcity

Dennis Ulmer; Jes Frellsen; Christian Hardmeier

arXiv:2210.15452·cs.CL·October 28, 2022

Exploring Predictive Uncertainty and Calibration in NLP: A Study on the Impact of Method & Data Scarcity

Dennis Ulmer, Jes Frellsen, Christian Hardmeier

PDF

Open Access 1 Repo

TL;DR

This paper examines how different methods estimate predictive uncertainty in NLP models, especially under data scarcity, revealing that pre-trained models and ensembles perform best but can be affected by data volume, with uncertainties mainly driven by data rather than model factors.

Contribution

It provides a comprehensive evaluation of uncertainty estimation methods in low-resource NLP settings, highlighting the influence of data scarcity and model choice on uncertainty quality.

Findings

01

Pre-trained models and ensembles yield the best uncertainty estimates.

02

More data can sometimes degrade the quality of uncertainty estimates.

03

Model uncertainty is less influential than data uncertainty in total uncertainty.

Abstract

We investigate the problem of determining the predictive confidence (or, conversely, uncertainty) of a neural classifier through the lens of low-resource languages. By training models on sub-sampled datasets in three different languages, we assess the quality of estimates from a wide array of approaches and their dependence on the amount of available data. We find that while approaches based on pre-trained models and ensembles achieve the best results overall, the quality of uncertainty estimates can surprisingly suffer with more data. We also perform a qualitative analysis of uncertainties on sequences, discovering that a model's total uncertainty seems to be influenced to a large degree by its data uncertainty, not model uncertainty. All model implementations are open-sourced in a software package.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

kaleidophon/nlp-low-resource-uncertainty
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsExplainable Artificial Intelligence (XAI) · Natural Language Processing Techniques · Topic Modeling