ELODI: Ensemble Logit Difference Inhibition for Positive-Congruent   Training

Yue Zhao; Yantao Shen; Yuanjun Xiong; Shuo Yang; Wei Xia; Zhuowen Tu,; Bernt Schiele; Stefano Soatto

arXiv:2205.06265·cs.LG·April 23, 2024·1 cites

ELODI: Ensemble Logit Difference Inhibition for Positive-Congruent Training

Yue Zhao, Yantao Shen, Yuanjun Xiong, Shuo Yang, Wei Xia, Zhuowen Tu,, Bernt Schiele, Stefano Soatto

PDF

Open Access 1 Repo

TL;DR

ELODI is a novel training method that reduces negative flips in model updates by distilling ensemble logits into a single model, maintaining accuracy and lowering error rates without increasing inference costs.

Contribution

The paper introduces ELODI, a new ensemble distillation technique that effectively reduces negative flips while preserving accuracy, using a generalized logit difference inhibition objective.

Findings

01

ELODI achieves lower negative flip rates compared to existing methods.

02

The method maintains high accuracy during model updates.

03

ELODI reduces inference costs by using a single model after distillation.

Abstract

Negative flips are errors introduced in a classification system when a legacy model is updated. Existing methods to reduce the negative flip rate (NFR) either do so at the expense of overall accuracy by forcing a new model to imitate the old models, or use ensembles, which multiply inference cost prohibitively. We analyze the role of ensembles in reducing NFR and observe that they remove negative flips that are typically not close to the decision boundary, but often exhibit large deviations in the distance among their logits. Based on the observation, we present a method, called Ensemble Logit Difference Inhibition (ELODI), to train a classification system that achieves paragon performance in both error rate and NFR, at the inference cost of a single model. The method distills a homogeneous ensemble to a single student model which is used to update the classification system. ELODI also…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

amazon-science/regression-constraint-model-upgrade
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdversarial Robustness in Machine Learning · Machine Learning and Data Classification · Machine Learning and Algorithms

MethodsFLIP