Malware Analysis with Symbolic Execution and Graph Kernel
Charles-Henry Bertrand Van Ouytsel, Axel Legay

TL;DR
This paper introduces a new malware analysis method combining symbolic execution with graph kernel techniques, specifically the Weisfeiler-Lehman kernel, to improve classification accuracy and efficiency over existing graph-based approaches.
Contribution
The authors develop an efficient open source toolchain utilizing graph kernels for malware classification, addressing limitations of previous graph-based machine learning methods.
Findings
Outperforms existing malware classification methods significantly
Uses Weisfeiler-Lehman kernel to capture local graph similarities
Demonstrates improved efficiency and accuracy in malware detection
Abstract
Malware analysis techniques are divided into static and dynamic analysis. Both techniques can be bypassed by circumvention techniques such as obfuscation. In a series of works, the authors have promoted the use of symbolic executions combined with machine learning to avoid such traps. Most of those works rely on natural graph-based representations that can then be plugged into graph-based learning algorithms such as Gspan. There are two main problems with this approach. The first one is in the cost of computing the graph. Indeed, working with graphs requires one to compute and representing the entire state-space of the file under analysis. As such computation is too cumbersome, the techniques often rely on developing strategies to compute a representative subgraph of the behaviors. Unfortunately, efficient graph-building strategies remain weakly explored. The second problem is in the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Malware Detection Techniques · Software Testing and Debugging Techniques · Digital and Cyber Forensics
