Retrofit: Continual Learning with Controlled Forgetting for Binary Security Detection and Analysis

Yiling He; Junchi Lei; Hongyu She; Shuo Shao; Xinran Zheng; Yiping Liu; Zhan Qin; Lorenzo Cavallaro

arXiv:2511.11439·cs.LG·April 24, 2026

Retrofit: Continual Learning with Controlled Forgetting for Binary Security Detection and Analysis

Yiling He, Junchi Lei, Hongyu She, Shuo Shao, Xinran Zheng, Yiping Liu, Zhan Qin, Lorenzo Cavallaro

PDF

TL;DR

RETROFIT introduces a continual learning method for binary security analysis that controls forgetting without needing historical data, improving malware detection and binary summarization performance.

Contribution

It proposes a novel knowledge consolidation approach with controlled forgetting via parameter merging and confidence-guided arbitration, enhancing continual learning in security tasks.

Findings

01

Significantly improves malware detection retention scores from 20.2% to 38.6%.

02

Over 2x BLEU score improvement in binary summarization over transfer learning.

03

Outperforms all baselines in cross-representation generalization.

Abstract

Binary security has increasingly relied on deep learning to reason about malware behavior and program semantics. However, the performance often degrades as threat landscapes evolve and code representations shift. While continual learning (CL) offers a natural solution through sequential updates, most existing approaches rely on data replay or unconstrained updates, limiting their applicability and effectiveness in data-sensitive security environments. We propose RETROFIT, which regulates knowledge retention and adaptation with controlled forgetting at each update, without requiring historical data. Our key idea is to consolidate previously trained and newly fine-tuned models, serving as teachers of legacy and emergent knowledge, through retrospective-free parameter merging. Forgetting control is achieved by 1) constraining parameter changes to low-rank and sparse subspaces for…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.