Towards Verifiable Generation: A Benchmark for Knowledge-aware Language   Model Attribution

Xinze Li; Yixin Cao; Liangming Pan; Yubo Ma; Aixin Sun

arXiv:2310.05634·cs.CL·May 24, 2024·22 cites

Towards Verifiable Generation: A Benchmark for Knowledge-aware Language Model Attribution

Xinze Li, Yixin Cao, Liangming Pan, Yubo Ma, Aixin Sun

PDF

Open Access 1 Repo 1 Video

TL;DR

This paper introduces KaLMA, a new benchmark and task for attributing language model outputs to structured knowledge from Knowledge Graphs, addressing hallucinations and improving attribution reliability.

Contribution

It extends attribution from unstructured texts to Knowledge Graphs, proposes a 'Conscious Incompetence' setting for incomplete knowledge, and develops a comprehensive evaluation metric and dataset.

Findings

01

Baseline models show room for improvement in citation accuracy.

02

The 'Conscious Incompetence' setting highlights the importance of retrieval quality.

03

The new benchmark facilitates better evaluation of knowledge-aware attribution.

Abstract

Although achieving great success, Large Language Models (LLMs) usually suffer from unreliable hallucinations. Although language attribution can be a potential solution, there are no suitable benchmarks and evaluation metrics to attribute LLMs to structured knowledge. In this paper, we define a new task of Knowledge-aware Language Model Attribution (KaLMA) that improves upon three core concerns with conventional attributed LMs. First, we extend attribution source from unstructured texts to Knowledge Graph (KG), whose rich structures benefit both the attribution performance and working scenarios. Second, we propose a new ``Conscious Incompetence" setting considering the incomplete knowledge repository, where the model identifies the need for supporting knowledge beyond the provided KG. Third, we propose a comprehensive automatic evaluation metric encompassing text quality, citation…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

lixinze777/knowledge-aware-language-model-attribution
noneOfficial

Videos

Towards Verifiable Generation: A Benchmark for Knowledge-aware Language Model Attribution· underline

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Biomedical Text Mining and Ontologies