LargePiG: Your Large Language Model is Secretly a Pointer Generator

Zhongxiang Sun; Zihua Si; Xiaoxue Zang; Kai Zheng; Yang Song; Xiao; Zhang; Jun Xu

arXiv:2410.11366·cs.CL·October 16, 2024

LargePiG: Your Large Language Model is Secretly a Pointer Generator

Zhongxiang Sun, Zihua Si, Xiaoxue Zang, Kai Zheng, Yang Song, Xiao, Zhang, Jun Xu

PDF

Open Access

TL;DR

This paper introduces LargePiG, a novel method that transforms large language models into pointer-generators to reduce hallucinations in query generation, improving factual accuracy and relevance.

Contribution

The paper presents a training-free, model-agnostic approach to separate content from form in LLM-generated queries, leveraging inherent attention weights to mitigate hallucinations.

Findings

01

LargePiG outperforms existing methods on document and video datasets.

02

LargePiG reduces hallucinations in vision-language models.

03

Improves factuality and accuracy in question-answering tasks.

Abstract

Recent research on query generation has focused on using Large Language Models (LLMs), which despite bringing state-of-the-art performance, also introduce issues with hallucinations in the generated queries. In this work, we introduce relevance hallucination and factuality hallucination as a new typology for hallucination problems brought by query generation based on LLMs. We propose an effective way to separate content from form in LLM-generated queries, which preserves the factual knowledge extracted and integrated from the inputs and compiles the syntactic structure, including function words, using the powerful linguistic capabilities of the LLM. Specifically, we introduce a model-agnostic and training-free method that turns the Large Language Model into a Pointer-Generator (LargePiG), where the pointer attention distribution leverages the LLM's inherent attention weights, and the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling

MethodsSoftmax · Attention Is All You Need