Mellivora Capensis: A Backdoor-Free Training Framework on the Poisoned Dataset without Auxiliary Data
Yuwen Pu, Jiahao Chen, Chunyi Zhou, Zhou Feng, Qingming Li, Chunqiang Hu, Shouling Ji

TL;DR
This paper introduces Mellivora Capensis, a novel training framework that defends against backdoor attacks in poisoned datasets without requiring auxiliary clean data, enhancing security in real-world data collection scenarios.
Contribution
The paper presents a theoretical analysis linking perturbations to backdoor triggers and proposes a robust, clean-data-free defense framework for backdoor mitigation.
Findings
Poisoned samples show greater robustness to perturbations than clean samples.
The proposed framework effectively trains models free of backdoors on poisoned datasets.
The method outperforms existing defenses in practical scenarios.
Abstract
The efficacy of deep learning models is profoundly influenced by the quality of their training data. Given the considerations of data diversity, data scale, and annotation expenses, model trainers frequently resort to sourcing and acquiring datasets from online repositories. Although economically pragmatic, this strategy exposes the models to substantial security vulnerabilities. Untrusted entities can clandestinely embed triggers within the dataset, facilitating the hijacking of the trained model on the poisoned dataset through backdoor attacks, which constitutes a grave security concern. Despite the proliferation of countermeasure research, their inherent limitations constrain their effectiveness in practical applications. These include the requirement for substantial quantities of clean samples, inconsistent defense performance across varying attack scenarios, and inadequate…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMetabolomics and Mass Spectrometry Studies · Computational Drug Discovery Methods
