Multi-Faceted Studies on Data Poisoning can Advance LLM Development

Pengfei He; Yue Xing; Han Xu; Zhen Xiang; Jiliang Tang

arXiv:2502.14182·cs.CR·February 21, 2025

Multi-Faceted Studies on Data Poisoning can Advance LLM Development

Pengfei He, Yue Xing, Han Xu, Zhen Xiang, Jiliang Tang

PDF

Open Access 1 Repo

TL;DR

This paper argues that comprehensive studies on data poisoning can enhance LLM development by improving safety, robustness, and understanding of model mechanisms, despite practical challenges in executing such attacks.

Contribution

It redefines the role of data poisoning in LLMs, emphasizing its potential to improve safety, trustworthiness, and mechanistic insights through multi-faceted research.

Findings

01

Practical data poisoning attacks are challenging due to LLM training complexity.

02

Data poisoning can help identify and mitigate biases and harmful outputs.

03

Studying data poisoning offers deeper understanding of LLM data-model interactions.

Abstract

The lifecycle of large language models (LLMs) is far more complex than that of traditional machine learning models, involving multiple training stages, diverse data sources, and varied inference methods. While prior research on data poisoning attacks has primarily focused on the safety vulnerabilities of LLMs, these attacks face significant challenges in practice. Secure data collection, rigorous data cleaning, and the multistage nature of LLM training make it difficult to inject poisoned data or reliably influence LLM behavior as intended. Given these challenges, this position paper proposes rethinking the role of data poisoning and argue that multi-faceted studies on data poisoning can advance LLM development. From a threat perspective, practical strategies for data poisoning attacks can help evaluate and address real safety risks to LLMs. From a trustworthiness perspective, data…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

PengfeiHePower/awesome-LLM-data-poisoning
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsScientific Computing and Data Management