Identifying the Source of Generation for Large Language Models

Bumjin Park; Jaesik Choi

arXiv:2407.12846·cs.CL·July 19, 2024

Identifying the Source of Generation for Large Language Models

Bumjin Park, Jaesik Choi

PDF

Open Access

TL;DR

This paper introduces a method for identifying the original source of tokens generated by large language models, enhancing transparency and reliability in AI-generated content.

Contribution

It proposes a novel token-level source identification technique using a bi-gram MLP, enabling source tracing for LLM outputs.

Findings

01

Token-level source identifiers are feasible for tracing document origins.

02

The method generalizes well across different datasets and models.

03

Source identification improves transparency in LLM-generated content.

Abstract

Large language models (LLMs) memorize text from several sources of documents. In pretraining, LLM trains to maximize the likelihood of text but neither receives the source of the text nor memorizes the source. Accordingly, LLM can not provide document information on the generated content, and users do not obtain any hint of reliability, which is crucial for factuality or privacy infringement. This work introduces token-level source identification in the decoding step, which maps the token representation to the reference document. We propose a bi-gram source identifier, a multi-layer perceptron with two successive token representations as input for better generalization. We conduct extensive experiments on Wikipedia and PG19 datasets with several LLMs, layer locations, and identifier sizes. The overall results show a possibility of token-level source identifiers for tracing the document,…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNatural Language Processing Techniques · Topic Modeling · Speech and dialogue systems

MethodsHierarchical Information Threading