Document Attribution: Examining Citation Relationships using Large Language Models

Vipula Rawte; Ryan A. Rossi; Franck Dernoncourt; Nedim Lipka

arXiv:2505.06324·cs.IR·May 13, 2025

Document Attribution: Examining Citation Relationships using Large Language Models

Vipula Rawte, Ryan A. Rossi, Franck Dernoncourt, Nedim Lipka

PDF

Open Access 1 Video

TL;DR

This paper investigates methods for improving document attribution in large language models, proposing zero-shot entailment techniques and analyzing attention mechanisms to enhance citation reliability and interpretability.

Contribution

It introduces a zero-shot entailment approach for attribution and examines the impact of attention layers on attribution performance in LLMs.

Findings

01

Zero-shot entailment improves attribution accuracy on benchmark datasets.

02

Attention mechanisms influence attribution performance across different layers.

03

Flan-ul2 outperforms baselines in citation correctness metrics.

Abstract

As Large Language Models (LLMs) are increasingly applied to document-based tasks - such as document summarization, question answering, and information extraction - where user requirements focus on retrieving information from provided documents rather than relying on the model's parametric knowledge, ensuring the trustworthiness and interpretability of these systems has become a critical concern. A central approach to addressing this challenge is attribution, which involves tracing the generated outputs back to their source documents. However, since LLMs can produce inaccurate or imprecise responses, it is crucial to assess the reliability of these citations. To tackle this, our work proposes two techniques. (1) A zero-shot approach that frames attribution as a straightforward textual entailment task. Our method using flan-ul2 demonstrates an improvement of 0.27% and 2.4% over the best…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

Document Attribution: Examining Citation Relationships using Large Language Models· underline

Taxonomy

TopicsTopic Modeling · Text Readability and Simplification · Advanced Text Analysis Techniques

MethodsSoftmax · Attention Is All You Need · Focus