Next-Gen CAPTCHAs: Leveraging the Cognitive Gap for Scalable and Diverse GUI-Agent Defense

Jiacheng Liu; Yaxin Luo; Jiacheng Cui; Xinyi Shang; Xiaohan Zhao; and Zhiqiang Shen

arXiv:2602.09012·cs.LG·February 10, 2026

Next-Gen CAPTCHAs: Leveraging the Cognitive Gap for Scalable and Diverse GUI-Agent Defense

Jiacheng Liu, Yaxin Luo, Jiacheng Cui, Xinyi Shang, Xiaohan Zhao, and Zhiqiang Shen

PDF

Open Access

TL;DR

This paper introduces Next-Gen CAPTCHAs that leverage the cognitive gap between humans and AI to provide scalable, dynamic, and robust defenses against advanced GUI-enabled agents capable of solving complex logic puzzles.

Contribution

It presents a novel CAPTCHA framework that uses dynamic, reasoning-based tasks to differentiate humans from AI, supported by a scalable data generation pipeline for large-scale evaluation.

Findings

01

Achieved high pass rates on complex logic puzzles with advanced AI models.

02

Developed a scalable, unbounded CAPTCHA generation system.

03

Demonstrated robustness of the cognitive gap approach against modern AI agents.

Abstract

The rapid evolution of GUI-enabled agents has rendered traditional CAPTCHAs obsolete. While previous benchmarks like OpenCaptchaWorld established a baseline for evaluating multimodal agents, recent advancements in reasoning-heavy models, such as Gemini3-Pro-High and GPT-5.2-Xhigh have effectively collapsed this security barrier, achieving pass rates as high as 90% on complex logic puzzles like "Bingo". In response, we introduce Next-Gen CAPTCHAs, a scalable defense framework designed to secure the next-generation web against the advanced agents. Unlike static datasets, our benchmark is built upon a robust data generation pipeline, allowing for large-scale and easily scalable evaluations, notably, for backend-supported types, our system is capable of generating effectively unbounded CAPTCHA instances. We exploit the persistent human-agent "Cognitive Gap" in interactive perception,…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsUser Authentication and Security Systems · Advanced Malware Detection Techniques · AI in Service Interactions