MI-to-Mid Distilled Compression (M2M-DC): An Hybrid-Information-Guided-Block Pruning with Progressive Inner Slicing Approach to Model Compression
Lionel Levine, Haniyeh Ehsani Oskouie, Sajjad Ghiasvand, Majid Sarrafzadeh

TL;DR
M2M-DC is a novel model compression framework that combines information-guided block pruning with progressive inner slicing and staged knowledge distillation, achieving high accuracy with significantly reduced computational cost.
Contribution
The paper introduces a new two-scale, shape-safe compression method that effectively prunes and slices residual CNNs, extending to inverted-residual architectures with minimal modifications.
Findings
ResNet-18 achieves 85.46% Top-1 accuracy with 72% parameters and 63% GMacs.
ResNet-34 reaches 85.02% Top-1 accuracy with 74% parameters and GMacs.
MobileNetV2 improves to 68.54% Top-1 accuracy at 27% parameters, surpassing the teacher.
Abstract
We introduce MI-to-Mid Distilled Compression (M2M-DC), a two-scale, shape-safe compression framework that interleaves information-guided block pruning with progressive inner slicing and staged knowledge distillation (KD). First, M2M-DC ranks residual (or inverted-residual) blocks by a label-aware mutual information (MI) signal and removes the least informative units (structured prune-after-training). It then alternates short KD phases with stage-coherent, residual-safe channel slicing: (i) stage "planes" (co-slicing conv2 out-channels with the downsample path and next-stage inputs), and (ii) an optional mid-channel trim (conv1 out / bn1 / conv2 in). This targets complementary redundancy, whole computational motifs and within-stage width while preserving residual shape invariants. On CIFAR-100, M2M-DC yields a clean accuracy-compute frontier. For ResNet-18, we obtain 85.46% Top-1 with…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Wireless Signal Modulation Classification · Adversarial Robustness in Machine Learning
