VPN: Verification of Poisoning in Neural Networks

Youcheng Sun; Muhammad Usman; Divya Gopinath; Corina S.; P\u{a}s\u{a}reanu

arXiv:2205.03894·cs.CR·May 10, 2022

VPN: Verification of Poisoning in Neural Networks

Youcheng Sun, Muhammad Usman, Divya Gopinath, Corina S., P\u{a}s\u{a}reanu

PDF

Open Access

TL;DR

This paper introduces a verification method for detecting data poisoning in neural networks by formulating poisoning checks as properties verifiable with existing tools, enabling detection of triggers that cause misclassification.

Contribution

It presents a novel approach to verify data poisoning in neural networks using formal tools, including transferability of triggers across models and applicability to real-world image classifiers.

Findings

01

Verification of poisoning triggers is feasible with existing tools.

02

Discovered triggers transfer from small to large models.

03

Applicable to state-of-the-art image classification models.

Abstract

Neural networks are successfully used in a variety of applications, many of them having safety and security concerns. As a result researchers have proposed formal verification techniques for verifying neural network properties. While previous efforts have mainly focused on checking local robustness in neural networks, we instead study another neural network security issue, namely data poisoning. In this case an attacker inserts a trigger into a subset of the training data, in such a way that at test time, this trigger in an input causes the trained model to misclassify to some target class. We show how to formulate the check for data poisoning as a property that can be checked with off-the-shelf verification tools, such as Marabou and nneum, where counterexamples of failed checks constitute the triggers. We further show that the discovered triggers are `transferable' from a small model…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdversarial Robustness in Machine Learning · Explainable Artificial Intelligence (XAI)