Adjust-free adversarial example generation in speech recognition using   evolutionary multi-objective optimization under black-box condition

Shoma Ishida; Satoshi Ono

arXiv:2012.11138·cs.SD·July 9, 2024

Adjust-free adversarial example generation in speech recognition using evolutionary multi-objective optimization under black-box condition

Shoma Ishida, Satoshi Ono

PDF

TL;DR

This paper introduces a black-box adversarial attack method for speech recognition that uses evolutionary multi-objective optimization to generate robust, adjust-free adversarial examples unaffected by timing lag.

Contribution

It presents a novel EMO-based approach for creating robust adversarial speech examples that do not require timing adjustments, improving attack robustness under black-box conditions.

Findings

01

Successfully generated adjust-free adversarial examples.

02

Examples are robust against timing lag.

03

Method outperforms previous approaches in robustness.

Abstract

This paper proposes a black-box adversarial attack method to automatic speech recognition systems. Some studies have attempted to attack neural networks for speech recognition; however, these methods did not consider the robustness of generated adversarial examples against timing lag with a target speech. The proposed method in this paper adopts Evolutionary Multi-objective Optimization (EMO)that allows it generating robust adversarial examples under black-box scenario. Experimental results showed that the proposed method successfully generated adjust-free adversarial examples, which are sufficiently robust against timing lag so that an attacker does not need to take the timing of playing it against the target speech.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.