Robust and Verifiable Information Embedding Attacks to Deep Neural   Networks via Error-Correcting Codes

Jinyuan Jia; Binghui Wang; Neil Zhenqiang Gong

arXiv:2010.13751·cs.CR·October 27, 2020

Robust and Verifiable Information Embedding Attacks to Deep Neural Networks via Error-Correcting Codes

Jinyuan Jia, Binghui Wang, Neil Zhenqiang Gong

PDF

Open Access

TL;DR

This paper introduces a new information embedding attack on deep neural networks that is both verifiable and robust, utilizing error-correcting codes and adaptive querying to reliably recover embedded messages even after post-processing.

Contribution

The work develops a novel attack method combining Turbo codes and CRC for verifiable, robust message embedding and recovery in DNNs, addressing limitations of prior attacks.

Findings

01

Successfully verifies message correctness using CRC.

02

Achieves robustness against 8 different post-processing methods.

03

Accurately recovers embedded messages in all tested scenarios.

Abstract

In the era of deep learning, a user often leverages a third-party machine learning tool to train a deep neural network (DNN) classifier and then deploys the classifier as an end-user software product or a cloud service. In an information embedding attack, an attacker is the provider of a malicious third-party machine learning tool. The attacker embeds a message into the DNN classifier during training and recovers the message via querying the API of the black-box classifier after the user deploys it. Information embedding attacks have attracted growing attention because of various applications such as watermarking DNN classifiers and compromising user privacy. State-of-the-art information embedding attacks have two key limitations: 1) they cannot verify the correctness of the recovered message, and 2) they are not robust against post-processing of the classifier. In this work, we aim…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdversarial Robustness in Machine Learning · Internet Traffic Analysis and Secure E-voting · Wireless Signal Modulation Classification