Robust and Verifiable Information Embedding Attacks to Deep Neural Networks via Error-Correcting Codes
Jinyuan Jia, Binghui Wang, Neil Zhenqiang Gong

TL;DR
This paper introduces a new information embedding attack on deep neural networks that is both verifiable and robust, utilizing error-correcting codes and adaptive querying to reliably recover embedded messages even after post-processing.
Contribution
The work develops a novel attack method combining Turbo codes and CRC for verifiable, robust message embedding and recovery in DNNs, addressing limitations of prior attacks.
Findings
Successfully verifies message correctness using CRC.
Achieves robustness against 8 different post-processing methods.
Accurately recovers embedded messages in all tested scenarios.
Abstract
In the era of deep learning, a user often leverages a third-party machine learning tool to train a deep neural network (DNN) classifier and then deploys the classifier as an end-user software product or a cloud service. In an information embedding attack, an attacker is the provider of a malicious third-party machine learning tool. The attacker embeds a message into the DNN classifier during training and recovers the message via querying the API of the black-box classifier after the user deploys it. Information embedding attacks have attracted growing attention because of various applications such as watermarking DNN classifiers and compromising user privacy. State-of-the-art information embedding attacks have two key limitations: 1) they cannot verify the correctness of the recovered message, and 2) they are not robust against post-processing of the classifier. In this work, we aim…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdversarial Robustness in Machine Learning · Internet Traffic Analysis and Secure E-voting · Wireless Signal Modulation Classification
