Fingerprinting Encrypted Voice Traffic on Smart Speakers with Deep   Learning

Chenggang Wang; Sean Kennedy; Haipeng Li; King Hudson; Gowtham Atluri,; Xuetao Wei; Wenhai Sun; Boyang Wang

arXiv:2005.09800·cs.CR·May 21, 2020·5 cites

Fingerprinting Encrypted Voice Traffic on Smart Speakers with Deep Learning

Chenggang Wang, Sean Kennedy, Haipeng Li, King Hudson, Gowtham Atluri,, Xuetao Wei, Wenhai Sun, Boyang Wang

PDF

Open Access 1 Repo

TL;DR

This study demonstrates that deep learning can effectively fingerprint encrypted voice traffic from smart speakers, revealing user commands with high accuracy and proposing a defense to mitigate this privacy risk.

Contribution

The paper introduces a novel deep learning-based attack on encrypted voice traffic of smart speakers and proposes an effective traffic obfuscation defense.

Findings

01

Attack achieves 92.89% accuracy on Amazon Echo

02

Defense reduces attack accuracy to 32.18%

03

Effective fingerprinting even with only incoming traffic

Abstract

This paper investigates the privacy leakage of smart speakers under an encrypted traffic analysis attack, referred to as voice command fingerprinting. In this attack, an adversary can eavesdrop both outgoing and incoming encrypted voice traffic of a smart speaker, and infers which voice command a user says over encrypted traffic. We first built an automatic voice traffic collection tool and collected two large-scale datasets on two smart speakers, Amazon Echo and Google Home. Then, we implemented proof-of-concept attacks by leveraging deep learning. Our experimental results over the two datasets indicate disturbing privacy concerns. Specifically, compared to 1% accuracy with random guess, our attacks can correctly infer voice commands over encrypted traffic with 92.89\% accuracy on Amazon Echo. Despite variances that human voices may cause on outgoing traffic, our proof-of-concept…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

SmartHomePrivacyProject/DeepVCFingerprinting
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsInternet Traffic Analysis and Secure E-voting · Adversarial Robustness in Machine Learning · Hate Speech and Cyberbullying Detection