Hide in Plain Sight: Clean-Label Backdoor for Auditing Membership Inference
Depeng Chen, Hao Chen, Hulin Jin, Jie Cui, Hong Zhong

TL;DR
This paper introduces a stealthy, clean-label backdoor method for membership inference attacks that enables effective data auditing without detectable label alterations, enhancing privacy assessment tools.
Contribution
The paper presents a novel clean-label backdoor approach for MIAs that improves stealthiness and robustness, using an optimal trigger generated by a shadow model to mimic target model behavior.
Findings
Achieves high attack success rates across datasets and models
Outperforms baseline methods in stealth and effectiveness
Maintains natural labels, avoiding visual artifacts
Abstract
Membership inference attacks (MIAs) are critical tools for assessing privacy risks and ensuring compliance with regulations like the General Data Protection Regulation (GDPR). However, their potential for auditing unauthorized use of data remains under explored. To bridge this gap, we propose a novel clean-label backdoor-based approach for MIAs, designed specifically for robust and stealthy data auditing. Unlike conventional methods that rely on detectable poisoned samples with altered labels, our approach retains natural labels, enhancing stealthiness even at low poisoning rates. Our approach employs an optimal trigger generated by a shadow model that mimics the target model's behavior. This design minimizes the feature-space distance between triggered samples and the source class while preserving the original data labels. The result is a powerful and undetectable auditing mechanism…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsFinancial Reporting and XBRL
