Improving Generated and Retrieved Knowledge Combination Through   Zero-shot Generation

Xinkai Du; Quanjie Han; Chao Lv; Yan Liu; Yalin Sun; Hao Shu; Hongbo; Shan; Maosong Sun

arXiv:2412.18800·cs.CL·December 30, 2024

Improving Generated and Retrieved Knowledge Combination Through Zero-shot Generation

Xinkai Du, Quanjie Han, Chao Lv, Yan Liu, Yalin Sun, Hao Shu, Hongbo, Shan, Maosong Sun

PDF

Open Access

TL;DR

This paper introduces BRMGR, an unsupervised re-ranking framework that effectively combines retrieved and generated knowledge for open-domain QA, improving performance across multiple datasets.

Contribution

The paper proposes a novel unsupervised re-ranking approach for merging generated and retrieved knowledge, addressing the lack of labels in open-domain QA.

Findings

01

Improved accuracy on NQ and WebQ datasets (+1.7 and +1.6 points)

02

Achieved competitive results on TriviaQA dataset

03

Demonstrated equivalence to bipartite matching loss in knowledge pairing

Abstract

Open-domain Question Answering (QA) has garnered substantial interest by combining the advantages of faithfully retrieved passages and relevant passages generated through Large Language Models (LLMs). However, there is a lack of definitive labels available to pair these sources of knowledge. In order to address this issue, we propose an unsupervised and simple framework called Bi-Reranking for Merging Generated and Retrieved Knowledge (BRMGR), which utilizes re-ranking methods for both retrieved passages and LLM-generated passages. We pair the two types of passages using two separate re-ranking methods and then combine them through greedy matching. We demonstrate that BRMGR is equivalent to employing a bipartite matching loss when assigning each retrieved passage with a corresponding LLM-generated passage. The application of our model yielded experimental results from three datasets,…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsIntelligent Tutoring Systems and Adaptive Learning