Selective Generation for Controllable Language Models
Minjae Lee, Kyungmin Kim, Taesoo Kim, Sangdon Park

TL;DR
This paper introduces two algorithms, SGen^Sup and SGen^Semi, for controllable language generation that use textual entailment to ensure generated content meets correctness criteria with theoretical guarantees, improving trustworthiness.
Contribution
The paper proposes a novel framework for selective language generation based on textual entailment, including supervised and semi-supervised algorithms with formal FDR guarantees.
Findings
SGen^Sup effectively controls false discovery rate with human-annotated data.
SGen^Semi leverages unlabeled data via pseudo-labeling, maintaining FDR guarantees.
Both methods perform well on open and closed source language models.
Abstract
Trustworthiness of generative language models (GLMs) is crucial in their deployment to critical decision making systems. Hence, certified risk control methods such as selective prediction and conformal prediction have been applied to mitigating the hallucination problem in various supervised downstream tasks. However, the lack of appropriate correctness metric hinders applying such principled methods to language generation tasks. In this paper, we circumvent this problem by leveraging the concept of textual entailment to evaluate the correctness of the generated sequence, and propose two selective generation algorithms which control the false discovery rate with respect to the textual entailment relation (FDR-E) with a theoretical guarantee: and . , a direct modification of the selective…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
TopicsTopic Modeling · Natural Language Processing Techniques · Speech Recognition and Synthesis
