LargePiG: Your Large Language Model is Secretly a Pointer Generator
Zhongxiang Sun, Zihua Si, Xiaoxue Zang, Kai Zheng, Yang Song, Xiao, Zhang, Jun Xu

TL;DR
This paper introduces LargePiG, a novel method that transforms large language models into pointer-generators to reduce hallucinations in query generation, improving factual accuracy and relevance.
Contribution
The paper presents a training-free, model-agnostic approach to separate content from form in LLM-generated queries, leveraging inherent attention weights to mitigate hallucinations.
Findings
LargePiG outperforms existing methods on document and video datasets.
LargePiG reduces hallucinations in vision-language models.
Improves factuality and accuracy in question-answering tasks.
Abstract
Recent research on query generation has focused on using Large Language Models (LLMs), which despite bringing state-of-the-art performance, also introduce issues with hallucinations in the generated queries. In this work, we introduce relevance hallucination and factuality hallucination as a new typology for hallucination problems brought by query generation based on LLMs. We propose an effective way to separate content from form in LLM-generated queries, which preserves the factual knowledge extracted and integrated from the inputs and compiles the syntactic structure, including function words, using the powerful linguistic capabilities of the LLM. Specifically, we introduce a model-agnostic and training-free method that turns the Large Language Model into a Pointer-Generator (LargePiG), where the pointer attention distribution leverages the LLM's inherent attention weights, and the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling
MethodsSoftmax · Attention Is All You Need
