Semantic Learning and Emulation Based Cross-platform Binary Vulnerability Seeker
Jian Gao, Yu Jiang, Zhe Liu, Xin Yang, Cong Wang, Xun Jiao, Zijiang, Yang, Jiaguang Sun

TL;DR
BinSeeker is a novel cross-platform binary vulnerability seeker that combines semantic learning and emulation to accurately identify vulnerable functions with high speed, outperforming existing tools in accuracy and efficiency.
Contribution
This paper introduces BinSeeker, a new approach integrating semantic learning and emulation for precise binary vulnerability detection across platforms.
Findings
Achieves higher MRR (0.65) than state-of-the-art tools.
Attains 93.33% top-5 accuracy in vulnerability ranking.
Operates with an average detection time of 0.27 seconds.
Abstract
Clone detection is widely exploited for software vulnerability search. The approaches based on source code analysis cannot be applied to binary clone detection because the same source code can produce significantly different binaries. In this paper, we present BinSeeker, a cross-platform binary seeker that integrates semantic learning and emulation. With the help of the labeled semantic flow graph, BinSeeker can quickly identify M candidate functions that are most similar to the vulnerability from the target binary. The value of M is relatively large so this semantic learning procedure essentially eliminates those functions that are very unlikely to have the vulnerability. Then, semantic emulation is conducted on these M candidates to obtain their dynamic signature sequences. By comparing signature sequences, BinSeeker produces top-N functions that exhibit most similar behavior to that…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
