Towards Verifiable Generation: A Benchmark for Knowledge-aware Language Model Attribution
Xinze Li, Yixin Cao, Liangming Pan, Yubo Ma, Aixin Sun

TL;DR
This paper introduces KaLMA, a new benchmark and task for attributing language model outputs to structured knowledge from Knowledge Graphs, addressing hallucinations and improving attribution reliability.
Contribution
It extends attribution from unstructured texts to Knowledge Graphs, proposes a 'Conscious Incompetence' setting for incomplete knowledge, and develops a comprehensive evaluation metric and dataset.
Findings
Baseline models show room for improvement in citation accuracy.
The 'Conscious Incompetence' setting highlights the importance of retrieval quality.
The new benchmark facilitates better evaluation of knowledge-aware attribution.
Abstract
Although achieving great success, Large Language Models (LLMs) usually suffer from unreliable hallucinations. Although language attribution can be a potential solution, there are no suitable benchmarks and evaluation metrics to attribute LLMs to structured knowledge. In this paper, we define a new task of Knowledge-aware Language Model Attribution (KaLMA) that improves upon three core concerns with conventional attributed LMs. First, we extend attribution source from unstructured texts to Knowledge Graph (KG), whose rich structures benefit both the attribution performance and working scenarios. Second, we propose a new ``Conscious Incompetence" setting considering the incomplete knowledge repository, where the model identifies the need for supporting knowledge beyond the provided KG. Third, we propose a comprehensive automatic evaluation metric encompassing text quality, citation…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
TopicsTopic Modeling · Natural Language Processing Techniques · Biomedical Text Mining and Ontologies
