ASRJam: Human-Friendly AI Speech Jamming to Prevent Automated Phone Scams

Freddie Grabovski; Gilad Gressel; Yisroel Mirsky

arXiv:2506.11125·cs.CL·June 16, 2025

ASRJam: Human-Friendly AI Speech Jamming to Prevent Automated Phone Scams

Freddie Grabovski, Gilad Gressel, Yisroel Mirsky

PDF

Open Access

TL;DR

This paper introduces ASRJam, a novel audio jamming framework that disrupts automated voice scams by injecting adversarial perturbations, using natural distortions like reverberation to effectively hinder ASR systems while remaining human-friendly.

Contribution

The paper presents a new proactive defense method against voice scams that employs natural distortions to disrupt ASR without affecting human communication, outperforming existing adversarial techniques.

Findings

01

EchoGuard achieved the highest utility in disrupting ASR.

02

Natural distortions are effective and tolerable for humans.

03

User study confirmed EchoGuard's superior performance.

Abstract

Large Language Models (LLMs), combined with Text-to-Speech (TTS) and Automatic Speech Recognition (ASR), are increasingly used to automate voice phishing (vishing) scams. These systems are scalable and convincing, posing a significant security threat. We identify the ASR transcription step as the most vulnerable link in the scam pipeline and introduce ASRJam, a proactive defence framework that injects adversarial perturbations into the victim's audio to disrupt the attacker's ASR. This breaks the scam's feedback loop without affecting human callers, who can still understand the conversation. While prior adversarial audio techniques are often unpleasant and impractical for real-time use, we also propose EchoGuard, a novel jammer that leverages natural distortions, such as reverberation and echo, that are disruptive to ASR but tolerable to humans. To evaluate EchoGuard's effectiveness and…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsUser Authentication and Security Systems · Network Security and Intrusion Detection · Speech Recognition and Synthesis