LLM Attributor: Interactive Visual Attribution for LLM Generation

Seongmin Lee; Zijie J. Wang; Aishwarya Chakravarthy; Alec Helbling,; ShengYun Peng; Mansi Phute; Duen Horng Chau; Minsuk Kahng

arXiv:2404.01361·cs.CL·April 3, 2024·1 cites

LLM Attributor: Interactive Visual Attribution for LLM Generation

Seongmin Lee, Zijie J. Wang, Aishwarya Chakravarthy, Alec Helbling,, ShengYun Peng, Mansi Phute, Duen Horng Chau, Minsuk Kahng

PDF

Open Access 2 Repos 1 Video

TL;DR

LLM Attributor is an interactive visualization tool that helps users understand and inspect the training data influences on large language model outputs, improving transparency and trust.

Contribution

The paper introduces LLM Attributor, a novel Python library that provides interactive visualizations for data attribution in LLMs, supporting model inspection and comparison.

Findings

01

Enables visualization of training data influence on LLM outputs

02

Supports integration with computational notebooks for ease of use

03

Demonstrated on LLaMA2 models with diverse datasets

Abstract

While large language models (LLMs) have shown remarkable capability to generate convincing text across diverse domains, concerns around its potential risks have highlighted the importance of understanding the rationale behind text generation. We present LLM Attributor, a Python library that provides interactive visualizations for training data attribution of an LLM's text generation. Our library offers a new way to quickly attribute an LLM's text generation to training data points to inspect model behaviors, enhance its trustworthiness, and compare model-generated text with user-provided text. We describe the visual and interactive design of our tool and highlight usage scenarios for LLaMA2 models fine-tuned with two different datasets: online articles about recent disasters and finance-related question-answer pairs. Thanks to LLM Attributor's broad support for computational notebooks,…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

LLM Attributor: Interactive Visual Attribution for LLM Generation· underline

Taxonomy

TopicsSemantic Web and Ontologies · Digital Rights Management and Security · Natural Language Processing Techniques

MethodsLib