Model-agnostic clean-label backdoor mitigation in cybersecurity   environments

Giorgio Severi; Simona Boboila; John Holodnak; Kendra Kratkiewicz,; Rauf Izmailov; Michael J. De Lucia; Alina Oprea

arXiv:2407.08159·cs.CR·May 6, 2025·1 cites

Model-agnostic clean-label backdoor mitigation in cybersecurity environments

Giorgio Severi, Simona Boboila, John Holodnak, Kendra Kratkiewicz,, Rauf Izmailov, Michael J. De Lucia, Alina Oprea

PDF

Open Access

TL;DR

This paper introduces a novel, model-agnostic method for mitigating clean-label backdoor attacks in cybersecurity machine learning models by using density-based clustering and iterative scoring, effective across different data types and models.

Contribution

The work presents a new defense technique that does not rely on common assumptions, effectively mitigating clean-label poisoning attacks in cybersecurity contexts.

Findings

01

Effective mitigation of clean-label backdoor attacks in cybersecurity models

02

Applicable to multiple data modalities and model types

03

Preserves model utility while enhancing security

Abstract

The training phase of machine learning models is a delicate step, especially in cybersecurity contexts. Recent research has surfaced a series of insidious training-time attacks that inject backdoors in models designed for security classification tasks without altering the training labels. With this work, we propose new techniques that leverage insights in cybersecurity threat models to effectively mitigate these clean-label poisoning attacks, while preserving the model utility. By performing density-based clustering on a carefully chosen feature subspace, and progressively isolating the suspicious clusters through a novel iterative scoring procedure, our defensive mechanism can mitigate the attacks without requiring many of the common assumptions in the existing backdoor defense literature. To show the generality of our proposed mitigation, we evaluate it on two clean-label…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSecurity and Verification in Computing · Safety Systems Engineering in Autonomy · Access Control and Trust