Selective Generation for Controllable Language Models

Minjae Lee; Kyungmin Kim; Taesoo Kim; Sangdon Park

arXiv:2307.09254·cs.LG·January 28, 2025

Selective Generation for Controllable Language Models

Minjae Lee, Kyungmin Kim, Taesoo Kim, Sangdon Park

PDF

Open Access 1 Repo 1 Video

TL;DR

This paper introduces two algorithms, SGen^Sup and SGen^Semi, for controllable language generation that use textual entailment to ensure generated content meets correctness criteria with theoretical guarantees, improving trustworthiness.

Contribution

The paper proposes a novel framework for selective language generation based on textual entailment, including supervised and semi-supervised algorithms with formal FDR guarantees.

Findings

01

SGen^Sup effectively controls false discovery rate with human-annotated data.

02

SGen^Semi leverages unlabeled data via pseudo-labeling, maintaining FDR guarantees.

03

Both methods perform well on open and closed source language models.

Abstract

Trustworthiness of generative language models (GLMs) is crucial in their deployment to critical decision making systems. Hence, certified risk control methods such as selective prediction and conformal prediction have been applied to mitigating the hallucination problem in various supervised downstream tasks. However, the lack of appropriate correctness metric hinders applying such principled methods to language generation tasks. In this paper, we circumvent this problem by leveraging the concept of textual entailment to evaluate the correctness of the generated sequence, and propose two selective generation algorithms which control the false discovery rate with respect to the textual entailment relation (FDR-E) with a theoretical guarantee: $SGen^{Sup}$ and $SGen^{Semi}$ . $SGen^{Sup}$ , a direct modification of the selective…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

ml-postech/selective-generation
pytorchOfficial

Videos

Selective Generation for Controllable Language Models· slideslive

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Speech Recognition and Synthesis