Contextualized Sequence Likelihood: Enhanced Confidence Scores for   Natural Language Generation

Zhen Lin; Shubhendu Trivedi; Jimeng Sun

arXiv:2406.01806·cs.CL·June 5, 2024

Contextualized Sequence Likelihood: Enhanced Confidence Scores for Natural Language Generation

Zhen Lin, Shubhendu Trivedi, Jimeng Sun

PDF

Open Access 1 Repo 1 Video

TL;DR

This paper introduces Contextualized Sequence Likelihood (CSL), a new confidence scoring method for large language models that improves reliability by weighting tokens based on attention, enhancing natural language generation evaluation.

Contribution

The paper proposes CSL, a novel confidence measure that uses attention weights to better assess generation quality, outperforming existing methods across multiple datasets and models.

Findings

01

CSL significantly outperforms baseline confidence scores in predicting generation quality.

02

CSL is easy to implement and computationally efficient.

03

CSL shows improved reliability across various QA datasets and LLMs.

Abstract

The advent of large language models (LLMs) has dramatically advanced the state-of-the-art in numerous natural language generation tasks. For LLMs to be applied reliably, it is essential to have an accurate measure of their confidence. Currently, the most commonly used confidence score function is the likelihood of the generated sequence, which, however, conflates semantic and syntactic components. For instance, in question-answering (QA) tasks, an awkward phrasing of the correct answer might result in a lower probability prediction. Additionally, different tokens should be weighted differently depending on the context. In this work, we propose enhancing the predicted sequence probability by assigning different weights to various tokens using attention values elicited from the base LLM. By employing a validation set, we can identify the relevant attention heads, thereby significantly…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

zlin7/contextsl
pytorchOfficial

Videos

Contextualized Sequence Likelihood: Enhanced Confidence Scores for Natural Language Generation· underline

Taxonomy

TopicsNatural Language Processing Techniques · Topic Modeling · Speech Recognition and Synthesis

MethodsBalanced Selection · Circular Smooth Label