Enhancing AI-based Generation of Software Exploits with Contextual   Information

Pietro Liguori; Cristina Improta; Roberto Natella; Bojan Cukic and; Domenico Cotroneo

arXiv:2408.02402·cs.SE·September 9, 2024

Enhancing AI-based Generation of Software Exploits with Contextual Information

Pietro Liguori, Cristina Improta, Roberto Natella, Bojan Cukic and, Domenico Cotroneo

PDF

Open Access

TL;DR

This study evaluates how contextual information affects neural machine translation models' ability to generate offensive security code from natural language descriptions, highlighting the importance of optimal context for accuracy.

Contribution

It demonstrates the impact of contextual data on model performance and identifies an optimal level of context for generating accurate security exploits.

Findings

01

Context improves model performance significantly.

02

Additional context benefits diminish after a certain point.

03

Models can filter out irrelevant information effectively.

Abstract

This practical experience report explores Neural Machine Translation (NMT) models' capability to generate offensive security code from natural language (NL) descriptions, highlighting the significance of contextual understanding and its impact on model performance. Our study employs a dataset comprising real shellcodes to evaluate the models across various scenarios, including missing information, necessary context, and unnecessary context. The experiments are designed to assess the models' resilience against incomplete descriptions, their proficiency in leveraging context for enhanced accuracy, and their ability to discern irrelevant information. The findings reveal that the introduction of contextual data significantly improves performance. However, the benefits of additional context diminish beyond a certain point, indicating an optimal level of contextual information for model…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Malware Detection Techniques · Software Reliability and Analysis Research · Network Security and Intrusion Detection