TracLLM: A Generic Framework for Attributing Long Context LLMs
Yanting Wang, Wei Zou, Runpeng Geng, Jinyuan Jia

TL;DR
TracLLM introduces a generic, efficient framework for attributing the influence of specific texts in long contexts on LLM outputs, enhancing interpretability and debugging capabilities.
Contribution
It is the first framework tailored for long context LLMs that improves feature attribution effectiveness and efficiency through informed search and ensemble techniques.
Findings
Effective identification of influential texts in long contexts
Improved attribution accuracy with contribution score ensemble
Enhanced computational efficiency over existing methods
Abstract
Long context large language models (LLMs) are deployed in many real-world applications such as RAG, agent, and broad LLM-integrated applications. Given an instruction and a long context (e.g., documents, PDF files, webpages), a long context LLM can generate an output grounded in the provided context, aiming to provide more accurate, up-to-date, and verifiable outputs while reducing hallucinations and unsupported claims. This raises a research question: how to pinpoint the texts (e.g., sentences, passages, or paragraphs) in the context that contribute most to or are responsible for the generated output by an LLM? This process, which we call context traceback, has various real-world applications, such as 1) debugging LLM-based systems, 2) conducting post-attack forensic analysis for attacks (e.g., prompt injection attack, knowledge corruption attacks) to an LLM, and 3) highlighting…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Advanced Graph Neural Networks · Adversarial Robustness in Machine Learning
MethodsLinear Layer · Attention Dropout · Softmax · WordPiece · Refunds@Expedia|||How do I get a full refund from Expedia? · Weight Decay · Multi-Head Attention · Attention Is All You Need · Linear Warmup With Linear Decay · Dropout
