ELODI: Ensemble Logit Difference Inhibition for Positive-Congruent Training
Yue Zhao, Yantao Shen, Yuanjun Xiong, Shuo Yang, Wei Xia, Zhuowen Tu,, Bernt Schiele, Stefano Soatto

TL;DR
ELODI is a novel training method that reduces negative flips in model updates by distilling ensemble logits into a single model, maintaining accuracy and lowering error rates without increasing inference costs.
Contribution
The paper introduces ELODI, a new ensemble distillation technique that effectively reduces negative flips while preserving accuracy, using a generalized logit difference inhibition objective.
Findings
ELODI achieves lower negative flip rates compared to existing methods.
The method maintains high accuracy during model updates.
ELODI reduces inference costs by using a single model after distillation.
Abstract
Negative flips are errors introduced in a classification system when a legacy model is updated. Existing methods to reduce the negative flip rate (NFR) either do so at the expense of overall accuracy by forcing a new model to imitate the old models, or use ensembles, which multiply inference cost prohibitively. We analyze the role of ensembles in reducing NFR and observe that they remove negative flips that are typically not close to the decision boundary, but often exhibit large deviations in the distance among their logits. Based on the observation, we present a method, called Ensemble Logit Difference Inhibition (ELODI), to train a classification system that achieves paragon performance in both error rate and NFR, at the inference cost of a single model. The method distills a homogeneous ensemble to a single student model which is used to update the classification system. ELODI also…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdversarial Robustness in Machine Learning · Machine Learning and Data Classification · Machine Learning and Algorithms
MethodsFLIP
