Enhancing AI-based Generation of Software Exploits with Contextual Information
Pietro Liguori, Cristina Improta, Roberto Natella, Bojan Cukic and, Domenico Cotroneo

TL;DR
This study evaluates how contextual information affects neural machine translation models' ability to generate offensive security code from natural language descriptions, highlighting the importance of optimal context for accuracy.
Contribution
It demonstrates the impact of contextual data on model performance and identifies an optimal level of context for generating accurate security exploits.
Findings
Context improves model performance significantly.
Additional context benefits diminish after a certain point.
Models can filter out irrelevant information effectively.
Abstract
This practical experience report explores Neural Machine Translation (NMT) models' capability to generate offensive security code from natural language (NL) descriptions, highlighting the significance of contextual understanding and its impact on model performance. Our study employs a dataset comprising real shellcodes to evaluate the models across various scenarios, including missing information, necessary context, and unnecessary context. The experiments are designed to assess the models' resilience against incomplete descriptions, their proficiency in leveraging context for enhanced accuracy, and their ability to discern irrelevant information. The findings reveal that the introduction of contextual data significantly improves performance. However, the benefits of additional context diminish beyond a certain point, indicating an optimal level of contextual information for model…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Malware Detection Techniques · Software Reliability and Analysis Research · Network Security and Intrusion Detection
