Peering into the Mind of Language Models: An Approach for Attribution in   Contextual Question Answering

Anirudh Phukan; Shwetha Somasundaram; Apoorv Saxena; Koustava Goswami,; Balaji Vasan Srinivasan

arXiv:2405.17980·cs.CL·May 29, 2024·1 cites

Peering into the Mind of Language Models: An Approach for Attribution in Contextual Question Answering

Anirudh Phukan, Shwetha Somasundaram, Apoorv Saxena, Koustava Goswami,, Balaji Vasan Srinivasan

PDF

Open Access 1 Repo 1 Video

TL;DR

This paper introduces a novel attribution method for contextual question answering with large language models, leveraging hidden states to identify source segments without retraining, achieving competitive results and broad applicability.

Contribution

The authors propose a new attribution approach using LLM hidden states that does not require retraining, improving source identification in contextual QA tasks.

Findings

01

Performs on par or better than GPT-4 in identifying copied segments.

02

Works effectively across various LLM architectures.

03

Introduces a new dataset with token-level attribution annotations.

Abstract

With the enhancement in the field of generative artificial intelligence (AI), contextual question answering has become extremely relevant. Attributing model generations to the input source document is essential to ensure trustworthiness and reliability. We observe that when large language models (LLMs) are used for contextual question answering, the output answer often consists of text copied verbatim from the input prompt which is linked together with "glue text" generated by the LLM. Motivated by this, we propose that LLMs have an inherent awareness from where the text was copied, likely captured in the hidden states of the LLM. We introduce a novel method for attribution in contextual question answering, leveraging the hidden state representations of LLMs. Our approach bypasses the need for extensive model retraining and retrieval model overhead, offering granular attributions and…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Anirudh-Phukan/verifiability-granular
noneOfficial

Videos

Peering into the Mind of Language Models: An Approach for Attribution in Contextual Question Answering· underline

Taxonomy

TopicsTopic Modeling · Speech and dialogue systems · Semantic Web and Ontologies

MethodsAttention Is All You Need · Linear Layer · Byte Pair Encoding · Label Smoothing · Adam · Residual Connection · Position-Wise Feed-Forward Layer · Multi-Head Attention · Dropout · Dense Connections