CSS: Contrastive Semantic Similarity for Uncertainty Quantification of LLMs
Shuang Ao, Stefan Rueger, Advaith Siddharthan

TL;DR
This paper introduces a novel CLIP-based contrastive semantic similarity method to improve uncertainty quantification in large language models, leading to more reliable response filtering in question-answering tasks.
Contribution
It proposes a new CLIP-based feature extraction approach for better uncertainty estimation in LLMs, surpassing traditional NLI-based methods.
Findings
Outperforms baseline methods in estimating LLM response reliability
Effective in filtering unreliable LLM generations in QA tasks
Demonstrates robustness across multiple LLMs and datasets
Abstract
Despite the impressive capability of large language models (LLMs), knowing when to trust their generations remains an open challenge. The recent literature on uncertainty quantification of natural language generation (NLG) utilises a conventional natural language inference (NLI) classifier to measure the semantic dispersion of LLMs responses. These studies employ logits of NLI classifier for semantic clustering to estimate uncertainty. However, logits represent the probability of the predicted class and barely contain feature information for potential clustering. Alternatively, CLIP (Contrastive Language-Image Pre-training) performs impressively in extracting image-text pair features and measuring their similarity. To extend its usability, we propose Contrastive Semantic Similarity, the CLIP-based feature extraction module to obtain similarity features for measuring uncertainty for text…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNatural Language Processing Techniques · Semantic Web and Ontologies
MethodsContrastive Language-Image Pre-training
