MI-to-Mid Distilled Compression (M2M-DC): An Hybrid-Information-Guided-Block Pruning with Progressive Inner Slicing Approach to Model Compression

Lionel Levine; Haniyeh Ehsani Oskouie; Sajjad Ghiasvand; Majid Sarrafzadeh

arXiv:2511.06842·cs.LG·November 19, 2025

MI-to-Mid Distilled Compression (M2M-DC): An Hybrid-Information-Guided-Block Pruning with Progressive Inner Slicing Approach to Model Compression

Lionel Levine, Haniyeh Ehsani Oskouie, Sajjad Ghiasvand, Majid Sarrafzadeh

PDF

Open Access

TL;DR

M2M-DC is a novel model compression framework that combines information-guided block pruning with progressive inner slicing and staged knowledge distillation, achieving high accuracy with significantly reduced computational cost.

Contribution

The paper introduces a new two-scale, shape-safe compression method that effectively prunes and slices residual CNNs, extending to inverted-residual architectures with minimal modifications.

Findings

01

ResNet-18 achieves 85.46% Top-1 accuracy with 72% parameters and 63% GMacs.

02

ResNet-34 reaches 85.02% Top-1 accuracy with 74% parameters and GMacs.

03

MobileNetV2 improves to 68.54% Top-1 accuracy at 27% parameters, surpassing the teacher.

Abstract

We introduce MI-to-Mid Distilled Compression (M2M-DC), a two-scale, shape-safe compression framework that interleaves information-guided block pruning with progressive inner slicing and staged knowledge distillation (KD). First, M2M-DC ranks residual (or inverted-residual) blocks by a label-aware mutual information (MI) signal and removes the least informative units (structured prune-after-training). It then alternates short KD phases with stage-coherent, residual-safe channel slicing: (i) stage "planes" (co-slicing conv2 out-channels with the downsample path and next-stage inputs), and (ii) an optional mid-channel trim (conv1 out / bn1 / conv2 in). This targets complementary redundancy, whole computational motifs and within-stage width while preserving residual shape invariants. On CIFAR-100, M2M-DC yields a clean accuracy-compute frontier. For ResNet-18, we obtain 85.46% Top-1 with…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Neural Network Applications · Wireless Signal Modulation Classification · Adversarial Robustness in Machine Learning