JoPA:Explaining Large Language Model's Generation via Joint Prompt Attribution

Yurui Chang; Bochuan Cao; Yujia Wang; Jinghui Chen; Lu Lin

arXiv:2405.20404·cs.CL·September 17, 2025

JoPA:Explaining Large Language Model's Generation via Joint Prompt Attribution

Yurui Chang, Bochuan Cao, Yujia Wang, Jinghui Chen, Lu Lin

PDF

Open Access

TL;DR

This paper introduces JoPA, a novel framework for explaining how multiple prompts collaboratively influence large language model outputs, addressing the complexity of prompt interactions in text generation.

Contribution

JoPA formulates prompt attribution as a combinatorial optimization problem and proposes a probabilistic algorithm to identify influential prompt combinations for generation explanation.

Findings

01

JoPA effectively explains prompt influence on generation.

02

The framework demonstrates high faithfulness in explanations.

03

JoPA is efficient in identifying key prompt combinations.

Abstract

Large Language Models (LLMs) have demonstrated impressive performances in complex text generation tasks. However, the contribution of the input prompt to the generated content still remains obscure to humans, underscoring the necessity of understanding the causality between input and output pairs. Existing works for providing prompt-specific explanation often confine model output to be classification or next-word prediction. Few initial attempts aiming to explain the entire language generation often treat input prompt texts independently, ignoring their combinatorial effects on the follow-up generation. In this study, we introduce a counterfactual explanation framework based on Joint Prompt Attribution, JoPA, which aims to explain how a few prompt texts collaboratively influences the LLM's complete generation. Particularly, we formulate the task of prompt attribution for generation…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Explainable Artificial Intelligence (XAI)