CSS: Contrastive Semantic Similarity for Uncertainty Quantification of   LLMs

Shuang Ao; Stefan Rueger; Advaith Siddharthan

arXiv:2406.03158·cs.CL·June 6, 2024

CSS: Contrastive Semantic Similarity for Uncertainty Quantification of LLMs

Shuang Ao, Stefan Rueger, Advaith Siddharthan

PDF

Open Access 1 Repo

TL;DR

This paper introduces a novel CLIP-based contrastive semantic similarity method to improve uncertainty quantification in large language models, leading to more reliable response filtering in question-answering tasks.

Contribution

It proposes a new CLIP-based feature extraction approach for better uncertainty estimation in LLMs, surpassing traditional NLI-based methods.

Findings

01

Outperforms baseline methods in estimating LLM response reliability

02

Effective in filtering unreliable LLM generations in QA tasks

03

Demonstrates robustness across multiple LLMs and datasets

Abstract

Despite the impressive capability of large language models (LLMs), knowing when to trust their generations remains an open challenge. The recent literature on uncertainty quantification of natural language generation (NLG) utilises a conventional natural language inference (NLI) classifier to measure the semantic dispersion of LLMs responses. These studies employ logits of NLI classifier for semantic clustering to estimate uncertainty. However, logits represent the probability of the predicted class and barely contain feature information for potential clustering. Alternatively, CLIP (Contrastive Language-Image Pre-training) performs impressively in extracting image-text pair features and measuring their similarity. To extend its usability, we propose Contrastive Semantic Similarity, the CLIP-based feature extraction module to obtain similarity features for measuring uncertainty for text…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

aoshuang92/css_uq_llms
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNatural Language Processing Techniques · Semantic Web and Ontologies

MethodsContrastive Language-Image Pre-training