Retrofit: Continual Learning with Controlled Forgetting for Binary Security Detection and Analysis
Yiling He, Junchi Lei, Hongyu She, Shuo Shao, Xinran Zheng, Yiping Liu, Zhan Qin, Lorenzo Cavallaro

TL;DR
RETROFIT introduces a continual learning method for binary security analysis that controls forgetting without needing historical data, improving malware detection and binary summarization performance.
Contribution
It proposes a novel knowledge consolidation approach with controlled forgetting via parameter merging and confidence-guided arbitration, enhancing continual learning in security tasks.
Findings
Significantly improves malware detection retention scores from 20.2% to 38.6%.
Over 2x BLEU score improvement in binary summarization over transfer learning.
Outperforms all baselines in cross-representation generalization.
Abstract
Binary security has increasingly relied on deep learning to reason about malware behavior and program semantics. However, the performance often degrades as threat landscapes evolve and code representations shift. While continual learning (CL) offers a natural solution through sequential updates, most existing approaches rely on data replay or unconstrained updates, limiting their applicability and effectiveness in data-sensitive security environments. We propose RETROFIT, which regulates knowledge retention and adaptation with controlled forgetting at each update, without requiring historical data. Our key idea is to consolidate previously trained and newly fine-tuned models, serving as teachers of legacy and emergent knowledge, through retrospective-free parameter merging. Forgetting control is achieved by 1) constraining parameter changes to low-rank and sparse subspaces for…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
