Jointly Generating and Attributing Answers using Logits of Document-Identifier Tokens
Lucas Albarede, Jose Moreno, Lynda Tamine, Luce Lefeuvre

TL;DR
LoDIT is a novel method that jointly generates answers and attributes their sources in RAG models by leveraging token logits, improving trustworthiness, efficiency, and robustness over existing approaches.
Contribution
The paper introduces LoDIT, a new approach that directly uses token logits to jointly generate answers and attribute them, reducing latency and enhancing faithfulness.
Findings
LoDIT outperforms state-of-the-art models on Trust-Align benchmark.
LoDIT is more efficient in terms of latency.
LoDIT demonstrates robustness across different settings.
Abstract
Despite their impressive performances, Large Language Models (LLMs) remain prone to hallucination, which critically undermines their trustworthiness. While most of the previous work focused on tackling answer and attribution correctness, a recent line of work investigated faithfulness, with a focus on leveraging internal model signals to reflect a model's actual decision-making process while generating the answer. Nevertheless, these methods induce additional latency and have shown limitations in directly aligning token generation with attribution generation. In this paper, we introduce LoDIT, a method that jointly generates and faithfully attributes answers in RAG by leveraging specific token logits during generation. It consists of two steps: (1) marking the documents with specific token identifiers and then leveraging the logits of these tokens to estimate the contribution of each…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Multimodal Machine Learning Applications · Misinformation and Its Impacts
