Robust Model Compression Using Deep Hypotheses
Omri Armstrong, Ran Gilad-Bachrach

TL;DR
This paper introduces a new robust model compression method using deep hypotheses, capable of compressing various model types into small, interpretable models with enhanced robustness and accuracy.
Contribution
The paper presents the MEMO algorithm for multi-class deep hypothesis finding and the CREMBO method for robust model compression, extending depth-based approaches beyond binary classification.
Findings
CREMBO produces more accurate, robust compressed models.
The method outperforms Knowledge Distillation in DNN to DNN compression.
Effective across neural networks, ensembles, and other models.
Abstract
Machine Learning models should ideally be compact and robust. Compactness provides efficiency and comprehensibility whereas robustness provides resilience. Both topics have been studied in recent years but in isolation. Here we present a robust model compression scheme which is independent of model types: it can compress ensembles, neural networks and other types of models into diverse types of small models. The main building block is the notion of depth derived from robust statistics. Originally, depth was introduced as a measure of the centrality of a point in a sample such that the median is the deepest point. This concept was extended to classification functions which makes it possible to define the depth of a hypothesis and the median hypothesis. Algorithms have been suggested to approximate the median but they have been limited to binary classification. In this study, we present a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
TopicsMachine Learning and Data Classification · Neural Networks and Applications · Anomaly Detection Techniques and Applications
MethodsKnowledge Distillation
