EvoPatch-IoT: Evolution-Aware Cross-Architecture Vulnerability Retrieval and Patch-State Profiling for BusyBox-Based IoT Firmware
Yinhao Xiao, Huixi Li, and Yongluo Shen

TL;DR
EvoPatch-IoT is a novel evolution-aware framework that improves vulnerability retrieval and patch profiling in stripped BusyBox IoT firmware across multiple architectures, significantly reducing manual effort.
Contribution
It introduces a cross-architecture retrieval method combining instruction features, graph statistics, and historical data, along with a large benchmark dataset for IoT firmware analysis.
Findings
Achieves 34.56% Hit@1 and 56.24% Hit@10 in vulnerability retrieval.
Reduces manual inspection space by 98.98%.
Maintains high accuracy in patch-state classification across architectures.
Abstract
BusyBox is one of the most widely reused userland components in Linux-based Internet-of-Things (IoT) firmware, yet its security assessment remains difficult because firmware images are frequently stripped, vendor patch practices are inconsistent, and the same source component is compiled for heterogeneous architectures. We propose EvoPatch-IoT, an evolution-aware cross-architecture retrieval framework for stripped BusyBox firmware binaries. EvoPatch-IoT combines anonymous instruction/context features, graph-level statistics, per-binary geometric priors, and historical function prototypes to localize homologous and potentially vulnerable functions without relying on symbols, source paths, or version strings at test time. We further construct a large-scale BusyBox benchmark from 57 historical versions, 270 unstripped binaries, 285 stripped binaries, and 130 source releases, yielding…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
