Unleashing the Hidden Power of Compiler Optimization on Binary Code Difference: An Empirical Study
Xiaolei Ren, Michael Ho, Jiang Ming, Yu Lei, Li Li

TL;DR
This study systematically investigates how compiler optimization settings influence binary code differences, revealing that tailored optimization sequences can significantly undermine existing binary diffing tools and malware detection methods.
Contribution
The paper introduces BinTuner, a search-based tool that finds near-optimal compiler optimization sequences to maximize binary differences, exposing vulnerabilities in current binary comparison approaches.
Findings
BinTuner outperforms default optimization levels in creating binary differences.
Tailored optimization sequences can reduce malware detection rates by over 50%.
Existing binary diffing tools are vulnerable to non-default compiler optimizations.
Abstract
Since compiler optimization is the most common source contributing to binary code differences in syntax, testing the resilience against the changes caused by different compiler optimization settings has become a standard evaluation step for most binary diffing approaches. For example, 47 top-venue papers in the last 12 years compared different program versions compiled by default optimization levels (e.g., -Ox in GCC and LLVM). Although many of them claim they are immune to compiler transformations, it is yet unclear about their resistance to non-default optimization settings. Especially, we have observed that adversaries explored non-default compiler settings to amplify malware differences. This paper takes the first step to systematically studying the effectiveness of compiler optimization on binary code differences. We tailor search-based iterative compilation for the auto-tuning…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Malware Detection Techniques · Security and Verification in Computing · Software Engineering Research
