Mitigating Negative Flips via Margin Preserving Training
Simone Ricci, Niccol\`o Biondi, Federico Pernici, Alberto Del Bimbo

TL;DR
This paper introduces a margin-preserving training method that reduces negative flips in image classification models when updating with new classes, maintaining consistency and accuracy across model versions.
Contribution
It proposes a novel margin calibration approach combined with focal distillation to mitigate negative flips while preserving accuracy on both old and new classes.
Findings
Significantly reduces negative flip rate in experiments
Maintains high overall accuracy on benchmark datasets
Effective across various class-incremental scenarios
Abstract
Minimizing inconsistencies across successive versions of an AI system is as crucial as reducing the overall error. In image classification, such inconsistencies manifest as negative flips, where an updated model misclassifies test samples that were previously classified correctly. This issue becomes increasingly pronounced as the number of training classes grows over time, since adding new categories reduces the margin of each class and may introduce conflicting patterns that undermine their learning process, thereby degrading performance on the original subset. To mitigate negative flips, we propose a novel approach that preserves the margins of the original model while learning an improved one. Our method encourages a larger relative margin between the previously learned and newly introduced classes by introducing an explicit margin-calibration term on the logits. However, overly…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Taxonomy
TopicsDomain Adaptation and Few-Shot Learning · Adversarial Robustness in Machine Learning · Machine Learning and Data Classification
