Training Data Attribution (TDA): Examining Its Adoption & Use Cases

Deric Cheng; Juhan Bae; Justin Bullock; David Kristofferson

arXiv:2501.12642·cs.CY·January 23, 2025

Training Data Attribution (TDA): Examining Its Adoption & Use Cases

Deric Cheng, Juhan Bae, Justin Bullock, David Kristofferson

PDF

Open Access

TL;DR

This paper explores the development, adoption, and societal implications of Training Data Attribution (TDA) tools for AI, emphasizing their potential to reduce risks and the challenges related to data disclosure.

Contribution

It analyzes the feasibility of creating effective TDA tools for large-scale LLMs and discusses policy and societal impacts of TDA adoption.

Findings

01

TDA tools could significantly reduce AI risks if developed effectively.

02

Willingness of AI labs to disclose training data is a key bottleneck.

03

TDA has potential to enable new policies and safety measures.

Abstract

This report investigates Training Data Attribution (TDA) and its potential importance to and tractability for reducing extreme risks from AI. First, we discuss the plausibility and amount of effort it would take to bring existing TDA research efforts from their current state, to an efficient and accurate tool for TDA inference that can be run on frontier-scale LLMs. Next, we discuss the numerous research benefits AI labs will expect to see from using such TDA tooling. Then, we discuss a key outstanding bottleneck that would limit such TDA tooling from being accessible publicly: AI labs' willingness to disclose their training data. We suggest ways AI labs may work around these limitations, and discuss the willingness of governments to mandate such access. Assuming that AI labs willingly provide access to TDA inference, we then discuss what high-level societal benefits you might see. We…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsData Quality and Management