TracLLM: A Generic Framework for Attributing Long Context LLMs

Yanting Wang; Wei Zou; Runpeng Geng; Jinyuan Jia

arXiv:2506.04202·cs.CR·June 27, 2025

TracLLM: A Generic Framework for Attributing Long Context LLMs

Yanting Wang, Wei Zou, Runpeng Geng, Jinyuan Jia

PDF

Open Access 1 Repo

TL;DR

TracLLM introduces a generic, efficient framework for attributing the influence of specific texts in long contexts on LLM outputs, enhancing interpretability and debugging capabilities.

Contribution

It is the first framework tailored for long context LLMs that improves feature attribution effectiveness and efficiency through informed search and ensemble techniques.

Findings

01

Effective identification of influential texts in long contexts

02

Improved attribution accuracy with contribution score ensemble

03

Enhanced computational efficiency over existing methods

Abstract

Long context large language models (LLMs) are deployed in many real-world applications such as RAG, agent, and broad LLM-integrated applications. Given an instruction and a long context (e.g., documents, PDF files, webpages), a long context LLM can generate an output grounded in the provided context, aiming to provide more accurate, up-to-date, and verifiable outputs while reducing hallucinations and unsupported claims. This raises a research question: how to pinpoint the texts (e.g., sentences, passages, or paragraphs) in the context that contribute most to or are responsible for the generated output by an LLM? This process, which we call context traceback, has various real-world applications, such as 1) debugging LLM-based systems, 2) conducting post-attack forensic analysis for attacks (e.g., prompt injection attack, knowledge corruption attacks) to an LLM, and 3) highlighting…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

wang-yanting/tracllm
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Advanced Graph Neural Networks · Adversarial Robustness in Machine Learning

MethodsLinear Layer · Attention Dropout · Softmax · WordPiece · Refunds@Expedia|||How do I get a full refund from Expedia? · Weight Decay · Multi-Head Attention · Attention Is All You Need · Linear Warmup With Linear Decay · Dropout