From Rookie to Expert: Manipulating LLMs for Automated Vulnerability Exploitation in Enterprise Software

Moustapha Awwalou Diouf; Maimouna Tamah Diao; Iyiola Emmanuel Olatunji; Abdoul Kader Kabor\'e; Jordan Samhi; Gervais Mendy; Samuel Ouya; Jacques Klein; Tegawend\'e F. Bissyand\'e

arXiv:2512.22753·cs.SE·April 24, 2026

From Rookie to Expert: Manipulating LLMs for Automated Vulnerability Exploitation in Enterprise Software

Moustapha Awwalou Diouf, Maimouna Tamah Diao, Iyiola Emmanuel Olatunji, Abdoul Kader Kabor\'e, Jordan Samhi, Gervais Mendy, Samuel Ouya, Jacques Klein, Tegawend\'e F. Bissyand\'e

PDF

1 Repo

TL;DR

This paper demonstrates how publicly available large language models can be socially engineered to generate functional exploits for enterprise software vulnerabilities, challenging traditional security assumptions.

Contribution

It introduces the RSA strategy to manipulate LLMs into creating exploits, showing that non-technical actors can bypass security barriers easily.

Findings

01

All tested LLMs exploited every CVE within 3-5 prompts.

02

Eliminates manual effort previously needed for LLM-assisted attacks.

03

Invalidates core security principles by enabling non-technical exploitation.

Abstract

LLMs democratize software engineering by enabling non-programmers to create applications, but this same accessibility fundamentally undermines security assumptions that have guided software engineering for decades. We show in this work how publicly available LLMs can be socially engineered to transform novices into capable attackers, challenging the foundational principle that exploitation requires technical expertise. To that end, we propose RSA (Role-assignment, Scenario-pretexting, and Action-solicitation), a pretexting strategy that manipulates LLMs into generating functional exploits despite their safety mechanisms. Testing against Odoo -- a widely used ERP platform, we evaluated five mainstream LLMs (GPT-4o, Gemini, Claude, Microsoft Copilot, and DeepSeek) and successfully exploited every tested CVE: at least one LLM produced a functional exploit for each within 3-5 prompting…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

https://anonymous.4open.science/r/From-Rookie-to-Attacker-D8B3
github

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.