Evaluating and Improving the Robustness of Security Attack Detectors Generated by LLMs

Samuele Pasini; Jinhan Kim; Tommaso Aiello; Rocio Cabrera Lozoya; Antonino Sabetta; Paolo Tonella

arXiv:2411.18216·cs.SE·September 18, 2025

Evaluating and Improving the Robustness of Security Attack Detectors Generated by LLMs

Samuele Pasini, Jinhan Kim, Tommaso Aiello, Rocio Cabrera Lozoya, Antonino Sabetta, Paolo Tonella

PDF

Open Access 1 Repo

TL;DR

This paper presents a novel approach combining Retrieval Augmented Generation and Self-Ranking to enhance the robustness and accuracy of security attack detectors generated by Large Language Models, significantly improving detection performance.

Contribution

It introduces an integrated pipeline using RAG and Self-Ranking to improve LLM-generated security detectors, addressing knowledge limitations and robustness challenges.

Findings

01

Up to 71%pt improvement in XSS detection F2-Score.

02

Up to 43%pt improvement in SQL injection detection F2-Score.

03

Significant overall enhancement in attack detection robustness.

Abstract

Large Language Models (LLMs) are increasingly used in software development to generate functions, such as attack detectors, that implement security requirements. A key challenge is ensuring the LLMs have enough knowledge to address specific security requirements, such as information about existing attacks. For this, we propose an approach integrating Retrieval Augmented Generation (RAG) and Self-Ranking into the LLM pipeline. RAG enhances the robustness of the output by incorporating external knowledge sources, while the Self-Ranking technique, inspired by the concept of Self-Consistency, generates multiple reasoning paths and creates ranks to select the most robust detector. Our extensive empirical study targets code generated by LLMs to detect two prevalent injection attacks in web security: Cross-Site Scripting (XSS) and SQL injection (SQLi). Results show a significant improvement in…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

PasiniSamuele/Robust-Attack-Detectors-LLM
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNetwork Security and Intrusion Detection

MethodsRefunds@Expedia|||How do I get a full refund from Expedia? · Attention Is All You Need · Linear Layer · Linear Warmup With Linear Decay · Layer Normalization · Byte Pair Encoding · Adam · Residual Connection · Weight Decay · Softmax