Data Contamination Calibration for Black-box LLMs

Wentao Ye; Jiaqi Hu; Liyao Li; Haobo Wang; Gang Chen; Junbo Zhao

arXiv:2405.11930·cs.LG·June 4, 2024

Data Contamination Calibration for Black-box LLMs

Wentao Ye, Jiaqi Hu, Liyao Li, Haobo Wang, Gang Chen, Junbo Zhao

PDF

Open Access 1 Repo 2 Datasets 1 Video

TL;DR

This paper introduces Polarized Augment Calibration (PAC), a novel, plug-and-play method for detecting and reducing data contamination in large language models, supported by a new dataset and extensive experiments.

Contribution

The work presents PAC, a new method extending membership inference attacks to detect training data contamination in LLMs, applicable to various models and datasets.

Findings

01

PAC outperforms existing methods by at least 4.5% in contamination detection

02

Effective across more than 4 dataset formats and 10 base LLMs

03

Reveals significant contamination issues in real-world scenarios

Abstract

The rapid advancements of Large Language Models (LLMs) tightly associate with the expansion of the training data size. However, the unchecked ultra-large-scale training sets introduce a series of potential risks like data contamination, i.e. the benchmark data is used for training. In this work, we propose a holistic method named Polarized Augment Calibration (PAC) along with a new to-be-released dataset to detect the contaminated data and diminish the contamination effect. PAC extends the popular MIA (Membership Inference Attack) -- from machine learning community -- by forming a more global target at detecting training data to Clarify invisible training data. As a pioneering work, PAC is very much plug-and-play that can be integrated with most (if not all) current white- and black-box LLMs. By extensive experiments, PAC outperforms existing methods by at least 4.5%, towards data…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

yyy01/pac
noneOfficial

Datasets

Videos

Data Contamination Calibration for Black-box LLMs· underline

Taxonomy

TopicsParticle Detector Development and Performance · Magnetic confinement fusion research · Particle Accelerators and Free-Electron Lasers

MethodsBalanced Selection