StriderSPD: Structure-Guided Joint Representation Learning for Binary Security Patch Detection
Qingyuan Li, Chenchen Yu, Chuanyi Li, Xin-Cheng Wen, Cheryl Lee, Cuiyun Gao, and Bin Luo

TL;DR
StriderSPD introduces a structure-guided joint representation learning framework for binary security patch detection, effectively leveraging structural information and a novel training strategy to improve detection accuracy on realistic, disjoint datasets.
Contribution
The paper proposes a novel structure-guided joint representation learning framework for binary security patch detection, integrating a graph branch with an LLM and a two-stage training strategy for improved performance.
Findings
Effective detection of security patches in binary code.
Superior performance on a newly constructed, disjoint benchmark.
Enhanced alignment of assembly and pseudo-code representations.
Abstract
Vulnerabilities severely threaten software systems, making the timely application of security patches crucial for mitigating attacks. However, software vendors often silently patch vulnerabilities with limited disclosure, where Security Patch Detection (SPD) comes to protect software assets. Recently, most SPD studies have targeted Open-Source Software (OSS), yet a large portion of real-world software is closed-source, where patches are distributed as binaries without accessible source code. The limited binary SPD approaches often lift binaries to abstraction levels, i.e., assembly code or pseudo-code. However, assembly code is register-based instructions conveying limited semantics, while pseudo-code lacks parser-compatible grammar to extract structure, both hindering accurate vulnerability-fix representation learning. In addition, previous studies often obtain training and testing…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSoftware Engineering Research · Information and Cyber Security · Advanced Malware Detection Techniques
