Cornucopia: A Framework for Feedback Guided Generation of Binaries
Vidush Singhal, Akul Abhilash Pillai, Charitha Saumya, Milind, Kulkarni, Aravind Machiry

TL;DR
Cornucopia is an architecture-agnostic framework that generates diverse binaries from source code, enabling extensive testing of binary analysis tools and revealing their limitations through large-scale, feedback-guided binary creation.
Contribution
We introduce Cornucopia, a novel automated framework for generating large, diverse binary datasets from source code, improving testing and evaluation of binary analysis tools.
Findings
Generated 309K binaries across four architectures.
Revealed bugs and crashes in popular analysis tools.
Showed poor performance of machine learning tools on generated binaries.
Abstract
Binary analysis is an important capability required for many security and software engineering applications. Consequently, there are many binary analysis techniques and tools with varied capabilities. However, testing these tools requires a large, varied binary dataset with corresponding source-level information. In this paper, we present Cornucopia, an architecture agnostic automated framework that can generate a plethora of binaries from corresponding program source by exploiting compiler optimizations and feedback-guided learning. Our evaluation shows that Cornucopia was able to generate 309K binaries across four architectures (x86, x64, ARM, MIPS) with an average of 403 binaries for each program and outperforms Bintuner, a similar technique. Our experiments revealed issues with the LLVM optimization scheduler resulting in compiler crashes (300). Our evaluation of four popular…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Malware Detection Techniques · Software Reliability and Analysis Research · Software Testing and Debugging Techniques
