CERET: Cost-Effective Extrinsic Refinement for Text Generation

Jason Cai; Hang Su; Monica Sunkara; Igor Shalyminov; Saab Mansour

arXiv:2406.05588·cs.CL·November 5, 2024

CERET: Cost-Effective Extrinsic Refinement for Text Generation

Jason Cai, Hang Su, Monica Sunkara, Igor Shalyminov, Saab Mansour

PDF

Open Access 1 Repo 1 Video

TL;DR

CERET is a cost-effective method for refining text generated by large language models, improving quality and accuracy by leveraging semantic stability and uncertainty measures, with lower computational costs than existing self-improvement techniques.

Contribution

Introduces CERET, a novel refinement approach that enhances LLM output quality efficiently using semantic and uncertainty metrics, outperforming existing methods in accuracy and cost-effectiveness.

Findings

01

CERET improves Rouge-1 by ~1.6% in summarization.

02

CERET increases question answering hit rate by ~3.5%.

03

CERET uses only 9.4% of the latency of LLM Self-rerank.

Abstract

Large Language Models (LLMs) are powerful models for generation tasks, but they may not generate good quality outputs in their first attempt. Apart from model fine-tuning, existing approaches to improve prediction accuracy and quality typically involve LLM self-improvement / self-reflection that incorporate feedback from models themselves. Despite their effectiveness, these methods are hindered by their high computational cost and lack of scalability. In this work, we propose CERET, a method for refining text generations by considering semantic stability, entailment and inter-sample uncertainty measures. Experimental results show that CERET outperforms Self-consistency and Self-rerank baselines consistently under various task setups, by ~1.6% in Rouge-1 for abstractive summarization and ~3.5% in hit rate for question answering. Compared to LLM Self-rerank method, our approach only…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

amazon-science/ceret-llm-refine
pytorchOfficial

Videos

CERET: Cost-Effective Extrinsic Refinement for Text Generation· underline

Taxonomy

TopicsNatural Language Processing Techniques · Topic Modeling