Oedipus: LLM-enchanced Reasoning CAPTCHA Solver
Gelei Deng, Haoran Ou, Yi Liu, Jie Zhang, Tianwei Zhang, Yang Liu

TL;DR
This paper introduces Oedipus, a novel framework that leverages a domain-specific language and chain-of-thought reasoning to improve AI's ability to solve complex reasoning CAPTCHAs, revealing new security challenges.
Contribution
The paper presents a new end-to-end reasoning CAPTCHA solver using a DSL and sequential sub-steps, enhancing AI attack capabilities against modern CAPTCHA designs.
Findings
Oedipus achieves 63.5% success rate on complex CAPTCHAs.
LLMs struggle with reasoning CAPTCHAs without specialized frameworks.
Oedipus adapts to new CAPTCHA designs introduced in late 2023.
Abstract
CAPTCHAs have become a ubiquitous tool in safeguarding applications from automated bots. Over time, the arms race between CAPTCHA development and evasion techniques has led to increasingly sophisticated and diverse designs. The latest iteration, reasoning CAPTCHAs, exploits tasks that are intuitively simple for humans but challenging for conventional AI technologies, thereby enhancing security measures. Driven by the evolving AI capabilities, particularly the advancements in Large Language Models (LLMs), we investigate the potential of multimodal LLMs to solve modern reasoning CAPTCHAs. Our empirical analysis reveals that, despite their advanced reasoning capabilities, LLMs struggle to solve these CAPTCHAs effectively. In response, we introduce Oedipus, an innovative end-to-end framework for automated reasoning CAPTCHA solving. Central to this framework is a novel strategy that…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsUser Authentication and Security Systems · Spam and Phishing Detection · Advanced Malware Detection Techniques
