Robust CAPTCHA Using Audio Illusions in the Era of Large Language Models: from Evaluation to Advances

Ziqi Ding; Yunfeng Wan; Wei Song; Yi Liu; Gelei Deng; Nan Sun; Huadong Mo; Jingling Xue; Shidong Pan; Yuekang Li

arXiv:2601.08516·cs.SD·January 14, 2026

Robust CAPTCHA Using Audio Illusions in the Era of Large Language Models: from Evaluation to Advances

Ziqi Ding, Yunfeng Wan, Wei Song, Yi Liu, Gelei Deng, Nan Sun, Huadong Mo, Jingling Xue, Shidong Pan, Yuekang Li

PDF

Open Access

TL;DR

This paper evaluates the vulnerability of existing audio CAPTCHAs to advanced AI models and introduces IllusionAudio, a new approach leveraging audio illusions that is robust against such attacks and maintains high human usability.

Contribution

The paper presents a comprehensive evaluation framework for audio CAPTCHA security and introduces IllusionAudio, a novel CAPTCHA method using audio illusions to enhance robustness against AI-based attacks.

Findings

01

Most existing audio CAPTCHAs are vulnerable to LALM and ASR attacks.

02

IllusionAudio defeats all tested AI attacks while maintaining 100% human pass rate.

03

The evaluation framework ACEval effectively measures CAPTCHA robustness.

Abstract

CAPTCHAs are widely used by websites to block bots and spam by presenting challenges that are easy for humans but difficult for automated programs to solve. To improve accessibility, audio CAPTCHAs are designed to complement visual ones. However, the robustness of audio CAPTCHAs against advanced Large Audio Language Models (LALMs) and Automatic Speech Recognition (ASR) models remains unclear. In this paper, we introduce AI-CAPTCHA, a unified framework that offers (i) an evaluation framework, ACEval, which includes advanced LALM- and ASR-based solvers, and (ii) a novel audio CAPTCHA approach, IllusionAudio, leveraging audio illusions. Through extensive evaluations of seven widely deployed audio CAPTCHAs, we show that most existing methods can be solved with high success rates by advanced LALMs and ASR models, exposing critical security weaknesses. To address these vulnerabilities, we…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsUser Authentication and Security Systems · AI in Service Interactions · Hate Speech and Cyberbullying Detection