Cross-Inlining Binary Function Similarity Detection
Ang Jia, Ming Fan, Xi Xu, Wuxia Jin, Haijun Wang, Ting Liu

TL;DR
This paper introduces CI-Detector, a pattern-based model that effectively detects cross-inlining binary function similarities by leveraging attributed CFGs and GNNs, outperforming existing methods in precision and recall.
Contribution
The paper systematically investigates cross-inlining function similarity detection and proposes a novel pattern-based GNN model, CI-Detector, for improved accuracy.
Findings
Achieved 81% precision and 97% recall in detecting cross-inlining pairs.
Constructed a comprehensive cross-inlining dataset with 216 configurations.
Identified three common cross-inlining patterns that challenge existing detection methods.
Abstract
Binary function similarity detection plays an important role in a wide range of security applications. Existing works usually assume that the query function and target function share equal semantics and compare their full semantics to obtain the similarity. However, we find that the function mapping is more complex, especially when function inlining happens. In this paper, we will systematically investigate cross-inlining binary function similarity detection. We first construct a cross-inlining dataset by compiling 51 projects using 9 compilers, with 4 optimizations, to 6 architectures, with 2 inlining flags, which results in two datasets both with 216 combinations. Then we construct the cross-inlining function mappings by linking the common source functions in these two datasets. Through analysis of this dataset, we find that three cross-inlining patterns widely exist while existing…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Malware Detection Techniques · Network Security and Intrusion Detection · Software Engineering Research
