Deep-Learning-based Vulnerability Detection in Binary Executables
Andreas Schaad, Dominik Binder

TL;DR
This paper presents a deep learning approach using recurrent neural networks to detect and classify 23 different vulnerabilities in binary executables with high accuracy, expanding beyond previous methods limited to 4 vulnerability types.
Contribution
The study introduces a supervised deep learning method that significantly broadens vulnerability detection capabilities in binary code, achieving high accuracy for multiple vulnerability types.
Findings
Detected 23 vulnerabilities with high accuracy
Achieved 88% accuracy for multi-class classification
Non-vulnerable samples detected with over 98% precision
Abstract
The identification of vulnerabilities is an important element in the software development life cycle to ensure the security of software. While vulnerability identification based on the source code is a well studied field, the identification of vulnerabilities on basis of a binary executable without the corresponding source code is more challenging. Recent research [1] has shown, how such detection can be achieved by deep learning methods. However, that particular approach is limited to the identification of only 4 types of vulnerabilities. Subsequently, we analyze to what extent we could cover the identification of a larger variety of vulnerabilities. Therefore, a supervised deep learning approach using recurrent neural networks for the application of vulnerability detection based on binary executables is used. The underlying basis is a dataset with 50,651 samples of vulnerable code in…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSoftware Engineering Research · Software Reliability and Analysis Research · Advanced Malware Detection Techniques
MethodsTanh Activation · Sigmoid Activation · Long Short-Term Memory
