CUE: An Uncertainty Interpretation Framework for Text Classifiers Built on Pre-Trained Language Models
Jiazheng Li, Zhaoyue Sun, Bin Liang, Lin Gui, Yulan He

TL;DR
The paper introduces CUE, a framework that interprets predictive uncertainty in PLM-based text classifiers by perturbing latent representations and tracing back to input features, enhancing understanding of model reliability.
Contribution
CUE is a novel framework that maps PLM representations to a latent space, perturbs it to analyze uncertainty, and identifies input features responsible for uncertain predictions.
Findings
Effective in identifying uncertainty sources across tasks
Demonstrates applicability on multiple benchmark datasets
Provides insights into model decision-making processes
Abstract
Text classifiers built on Pre-trained Language Models (PLMs) have achieved remarkable progress in various tasks including sentiment analysis, natural language inference, and question-answering. However, the occurrence of uncertain predictions by these classifiers poses a challenge to their reliability when deployed in practical applications. Much effort has been devoted to designing various probes in order to understand what PLMs capture. But few studies have delved into factors influencing PLM-based classifiers' predictive uncertainty. In this paper, we propose a novel framework, called CUE, which aims to interpret uncertainties inherent in the predictions of PLM-based models. In particular, we first map PLM-encoded representations to a latent space via a variational auto-encoder. We then generate text representations by perturbing the latent space which causes fluctuation in…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Natural Language Processing Techniques · Sentiment Analysis and Opinion Mining
