Architectural Backdoors for Within-Batch Data Stealing and Model Inference Manipulation
Nicolas K\"uchler, Ivan Petrov, Conrad Grobler, Ilia Shumailov

TL;DR
This paper introduces architectural backdoors exploiting batching in neural networks to enable data theft and inference manipulation, demonstrating their feasibility and proposing a formal mitigation strategy.
Contribution
The paper presents a new class of architectural backdoors targeting batching processes, along with a formal Information Flow Control mitigation method.
Findings
Over 200 models exhibit unintended information leakage due to dynamic quantization.
Architectural backdoors can be injected into common models like Transformers.
Mitigation guarantees non-interference between user requests within a batch.
Abstract
For nearly a decade the academic community has investigated backdoors in neural networks, primarily focusing on classification tasks where adversaries manipulate the model prediction. While demonstrably malicious, the immediate real-world impact of such prediction-altering attacks has remained unclear. In this paper we introduce a novel and significantly more potent class of backdoors that builds upon recent advancements in architectural backdoors. We demonstrate how these backdoors can be specifically engineered to exploit batched inference, a common technique for hardware utilization, enabling large-scale user data manipulation and theft. By targeting the batching process, these architectural backdoors facilitate information leakage between concurrent user requests and allow attackers to fully control model responses directed at other users within the same batch. In other words, an…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsData Stream Mining Techniques · Advanced Data Storage Technologies · Neural Networks and Applications
MethodsSparse Evolutionary Training
