CaLM: Contrasting Large and Small Language Models to Verify Grounded   Generation

I-Hung Hsu; Zifeng Wang; Long T. Le; Lesly Miculicich; Nanyun Peng,; Chen-Yu Lee; Tomas Pfister

arXiv:2406.05365·cs.CL·June 25, 2024

CaLM: Contrasting Large and Small Language Models to Verify Grounded Generation

I-Hung Hsu, Zifeng Wang, Long T. Le, Lesly Miculicich, Nanyun Peng,, Chen-Yu Lee, Tomas Pfister

PDF

Open Access 1 Video

TL;DR

CaLM introduces a verification framework that uses contrasting large and small language models to improve grounded generation by ensuring responses align with cited sources, enhancing credibility without fine-tuning.

Contribution

The paper proposes CaLM, a novel framework that leverages contrasting small and large language models to verify and refine grounded responses based on cited sources.

Findings

01

Achieves 1.5% to 7% improvement in QA accuracy

02

Does not require model fine-tuning

03

Effective across multiple open-domain datasets

Abstract

Grounded generation aims to equip language models (LMs) with the ability to produce more credible and accountable responses by accurately citing verifiable sources. However, existing methods, by either feeding LMs with raw or preprocessed materials, remain prone to errors. To address this, we introduce CaLM, a novel verification framework. CaLM leverages the insight that a robust grounded response should be consistent with information derived solely from its cited sources. Our framework empowers smaller LMs, which rely less on parametric memory and excel at processing relevant information given a query, to validate the output of larger LMs. Larger LM responses that closely align with the smaller LMs' output, which relies exclusively on cited documents, are verified. Responses showing discrepancies are iteratively refined through a feedback loop. Experiments on three open-domain…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

CaLM: Contrasting Large and Small Language Models to Verify Grounded Generation· underline

Taxonomy

TopicsComputational and Text Analysis Methods · Topic Modeling · Big Data Technologies and Applications

MethodsALIGN