Towards Provably Unlearnable Examples via Bayes Error Optimisation

Ruihan Zhang; Jun Sun; Ee-Peng Lim; Peixin Zhang

arXiv:2511.08191·cs.AI·November 12, 2025

Towards Provably Unlearnable Examples via Bayes Error Optimisation

Ruihan Zhang, Jun Sun, Ee-Peng Lim, Peixin Zhang

PDF

Open Access

TL;DR

This paper introduces a formal, optimization-based method to create unlearnable data examples by maximising Bayes error, ensuring they remain unlearnable even when mixed with clean data, with proven guarantees and practical effectiveness.

Contribution

It proposes a novel, theoretically grounded approach to generate unlearnable examples by maximising Bayes error using projected gradient ascent, addressing limitations of heuristic methods.

Findings

01

Method provably increases Bayes error in data.

02

Effective in preventing model learning when mixed with clean data.

03

Consistent results across multiple datasets and models.

Abstract

The recent success of machine learning models, especially large-scale classifiers and language models, relies heavily on training with massive data. These data are often collected from online sources. This raises serious concerns about the protection of user data, as individuals may not have given consent for their data to be used in training. To address this concern, recent studies introduce the concept of unlearnable examples, i.e., data instances that appear natural but are intentionally altered to prevent models from effectively learning from them. While existing methods demonstrate empirical effectiveness, they typically rely on heuristic trials and lack formal guarantees. Besides, when unlearnable examples are mixed with clean data, as is often the case in practice, their unlearnability disappears. In this work, we propose a novel approach to constructing unlearnable examples by…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMachine Learning and Data Classification · Ethics and Social Impacts of AI · Artificial Intelligence in Healthcare and Education