Jointly Generating and Attributing Answers using Logits of Document-Identifier Tokens

Lucas Albarede; Jose Moreno; Lynda Tamine; Luce Lefeuvre

arXiv:2508.08942·cs.CL·August 13, 2025

Jointly Generating and Attributing Answers using Logits of Document-Identifier Tokens

Lucas Albarede, Jose Moreno, Lynda Tamine, Luce Lefeuvre

PDF

Open Access

TL;DR

LoDIT is a novel method that jointly generates answers and attributes their sources in RAG models by leveraging token logits, improving trustworthiness, efficiency, and robustness over existing approaches.

Contribution

The paper introduces LoDIT, a new approach that directly uses token logits to jointly generate answers and attribute them, reducing latency and enhancing faithfulness.

Findings

01

LoDIT outperforms state-of-the-art models on Trust-Align benchmark.

02

LoDIT is more efficient in terms of latency.

03

LoDIT demonstrates robustness across different settings.

Abstract

Despite their impressive performances, Large Language Models (LLMs) remain prone to hallucination, which critically undermines their trustworthiness. While most of the previous work focused on tackling answer and attribution correctness, a recent line of work investigated faithfulness, with a focus on leveraging internal model signals to reflect a model's actual decision-making process while generating the answer. Nevertheless, these methods induce additional latency and have shown limitations in directly aligning token generation with attribution generation. In this paper, we introduce LoDIT, a method that jointly generates and faithfully attributes answers in RAG by leveraging specific token logits during generation. It consists of two steps: (1) marking the documents with specific token identifiers and then leveraging the logits of these tokens to estimate the contribution of each…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Multimodal Machine Learning Applications · Misinformation and Its Impacts