BDefects4NN: A Backdoor Defect Database for Controlled Localization   Studies in Neural Networks

Yisong Xiao; Aishan Liu; Xinwei Zhang; Tianyuan Zhang; Tianlin Li,; Siyuan Liang; Xianglong Liu; Yang Liu; Dacheng Tao

arXiv:2412.00746·cs.SE·December 3, 2024

BDefects4NN: A Backdoor Defect Database for Controlled Localization Studies in Neural Networks

Yisong Xiao, Aishan Liu, Xinwei Zhang, Tianyuan Zhang, Tianlin Li,, Siyuan Liang, Xianglong Liu, Yang Liu, Dacheng Tao

PDF

Open Access

TL;DR

This paper introduces BDefects4NN, a comprehensive database of backdoor-defected neural networks, enabling controlled localization studies to improve detection and repair of malicious model defects in critical AI systems.

Contribution

It presents the first backdoor defect database with labeled defects at neuron granularity, facilitating research on defect localization and mitigation in neural networks.

Findings

01

Limited effectiveness of existing fault localization criteria for backdoor defects

02

Backdoor models pose significant threats in autonomous driving and LLMs

03

Current defect localization techniques need improvement

Abstract

Pre-trained large deep learning models are now serving as the dominant component for downstream middleware users and have revolutionized the learning paradigm, replacing the traditional approach of training from scratch locally. To reduce development costs, developers often integrate third-party pre-trained deep neural networks (DNNs) into their intelligent software systems. However, utilizing untrusted DNNs presents significant security risks, as these models may contain intentional backdoor defects resulting from the black-box training process. These backdoor defects can be activated by hidden triggers, allowing attackers to maliciously control the model and compromise the overall reliability of the intelligent software. To ensure the safe adoption of DNNs in critical software systems, it is crucial to establish a backdoor defect database for localization studies. This paper addresses…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsIndustrial Vision Systems and Defect Detection · Integrated Circuits and Semiconductor Failure Analysis · Adversarial Robustness in Machine Learning