Cache Telepathy: Leveraging Shared Resource Attacks to Learn DNN   Architectures

Mengjia Yan; Christopher Fletcher; Josep Torrellas

arXiv:1808.04761·cs.DC·August 15, 2018·44 cites

Cache Telepathy: Leveraging Shared Resource Attacks to Learn DNN Architectures

Mengjia Yan, Christopher Fletcher, Josep Torrellas

PDF

Open Access

TL;DR

This paper introduces Cache Telepathy, a cache side-channel attack that accurately infers DNN architectures by analyzing GEMM operations, significantly narrowing the search space for model identification.

Contribution

It presents a novel cache side-channel method to extract DNN architecture details, leveraging the dependence of inference on GEMM parameters, which was not previously exploited.

Findings

01

Successfully reduces architecture search space from over 10^35 to 16 for VGG with OpenBLAS.

02

Effective in attacking VGG and ResNet models using Prime+Probe and Flush+Reload.

03

Leverages the dependence of GEMM calls on DNN architecture parameters.

Abstract

Deep Neural Networks (DNNs) are fast becoming ubiquitous for their ability to attain good accuracy in various machine learning tasks. A DNN's architecture (i.e., its hyper-parameters) broadly determines the DNN's accuracy and performance, and is often confidential. Attacking a DNN in the cloud to obtain its architecture can potentially provide major commercial value. Further, attaining a DNN's architecture facilitates other, existing DNN attacks. This paper presents Cache Telepathy: a fast and accurate mechanism to steal a DNN's architecture using the cache side channel. Our attack is based on the insight that DNN inference relies heavily on tiled GEMM (Generalized Matrix Multiply), and that DNN architecture parameters determine the number of GEMM calls and the dimensions of the matrices used in the GEMM functions. Such information can be leaked through the cache side channel. This…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdversarial Robustness in Machine Learning · Advanced Memory and Neural Computing · Security and Verification in Computing

MethodsAverage Pooling · Dropout · 1x1 Convolution · Batch Normalization · Bottleneck Residual Block · Global Average Pooling · Residual Block · Dense Connections · *Communicated@Fast*How Do I Communicate to Expedia? · Kaiming Initialization