Oedipus: LLM-enchanced Reasoning CAPTCHA Solver

Gelei Deng; Haoran Ou; Yi Liu; Jie Zhang; Tianwei Zhang; Yang Liu

arXiv:2405.07496·cs.CR·May 14, 2024

Oedipus: LLM-enchanced Reasoning CAPTCHA Solver

Gelei Deng, Haoran Ou, Yi Liu, Jie Zhang, Tianwei Zhang, Yang Liu

PDF

Open Access

TL;DR

This paper introduces Oedipus, a novel framework that leverages a domain-specific language and chain-of-thought reasoning to improve AI's ability to solve complex reasoning CAPTCHAs, revealing new security challenges.

Contribution

The paper presents a new end-to-end reasoning CAPTCHA solver using a DSL and sequential sub-steps, enhancing AI attack capabilities against modern CAPTCHA designs.

Findings

01

Oedipus achieves 63.5% success rate on complex CAPTCHAs.

02

LLMs struggle with reasoning CAPTCHAs without specialized frameworks.

03

Oedipus adapts to new CAPTCHA designs introduced in late 2023.

Abstract

CAPTCHAs have become a ubiquitous tool in safeguarding applications from automated bots. Over time, the arms race between CAPTCHA development and evasion techniques has led to increasingly sophisticated and diverse designs. The latest iteration, reasoning CAPTCHAs, exploits tasks that are intuitively simple for humans but challenging for conventional AI technologies, thereby enhancing security measures. Driven by the evolving AI capabilities, particularly the advancements in Large Language Models (LLMs), we investigate the potential of multimodal LLMs to solve modern reasoning CAPTCHAs. Our empirical analysis reveals that, despite their advanced reasoning capabilities, LLMs struggle to solve these CAPTCHAs effectively. In response, we introduce Oedipus, an innovative end-to-end framework for automated reasoning CAPTCHA solving. Central to this framework is a novel strategy that…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsUser Authentication and Security Systems · Spam and Phishing Detection · Advanced Malware Detection Techniques