Black-box Adversarial Attacks on Commercial Speech Platforms with Minimal Information
Baolin Zheng, Peipei Jiang, Qian Wang, Qi Li, Chao Shen, Cong Wang,, Yunjie Ge, Qingyang Teng, Shenyi Zhang

TL;DR
This paper introduces novel black-box adversarial attack methods for commercial speech platforms, achieving high success rates without relying on confidence scores or internal model access, highlighting vulnerabilities in popular speech APIs and devices.
Contribution
It proposes Occam, a decision-only black-box attack for speech APIs, and NI-Occam, a non-interactive physical attack for voice devices, both demonstrating effective adversarial examples in practical scenarios.
Findings
Occam achieves 100% success rate on multiple speech APIs.
NI-Occam successfully fools major voice assistants with high transferability.
Both methods operate effectively without internal model knowledge or interaction.
Abstract
Adversarial attacks against commercial black-box speech platforms, including cloud speech APIs and voice control devices, have received little attention until recent years. The current "black-box" attacks all heavily rely on the knowledge of prediction/confidence scores to craft effective adversarial examples, which can be intuitively defended by service providers without returning these messages. In this paper, we propose two novel adversarial attacks in more practical and rigorous scenarios. For commercial cloud speech APIs, we propose Occam, a decision-only black-box adversarial attack, where only final decisions are available to the adversary. In Occam, we formulate the decision-only AE generation as a discontinuous large-scale global optimization problem, and solve it by adaptively decomposing this complicated problem into a set of sub-problems and cooperatively optimizing each…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
Methodstravel james · Autoencoders
