Hallucination Augmented Recitations for Language Models

Abdullatif K\"oksal; Renat Aksitov; Chung-Ching Chang

arXiv:2311.07424·cs.CL·November 14, 2023·1 cites

Hallucination Augmented Recitations for Language Models

Abdullatif K\"oksal, Renat Aksitov, Chung-Ching Chang

PDF

Open Access

TL;DR

This paper introduces Hallucination Augmented Recitations (HAR), a novel method for creating counterfactual datasets using hallucination in LLMs to enhance attribution and factual grounding in open book QA tasks.

Contribution

The paper proposes HAR, a new approach to generate counterfactual datasets with hallucination, leading to improved attribution and QA performance over factual datasets.

Findings

01

Up to 8.0% increase in F1 score on open book QA.

02

Counterfactual datasets outperform factual datasets even with smaller size.

03

Improvements are consistent across various datasets and model sizes.

Abstract

Attribution is a key concept in large language models (LLMs) as it enables control over information sources and enhances the factuality of LLMs. While existing approaches utilize open book question answering to improve attribution, factual datasets may reward language models to recall facts that they already know from their pretraining data, not attribution. In contrast, counterfactual open book QA datasets would further improve attribution because the answer could only be grounded in the given text. We propose Hallucination Augmented Recitations (HAR) for creating counterfactual datasets by utilizing hallucination in LLMs to improve attribution. For open book QA as a case study, we demonstrate that models finetuned with our counterfactual datasets improve text grounding, leading to better open book QA performance, with up to an 8.0% increase in F1 score. Our counterfactual dataset…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Machine Learning in Healthcare · Artificial Intelligence in Healthcare and Education