Effective Layer Pruning Through Similarity Metric Perspective
Ian Pons, Bruno Yamamoto, Anna H. Reali Costa, Artur Jordao

TL;DR
This paper proposes a novel layer pruning method using the Centered Kernel Alignment metric to measure layer importance, achieving high compression with minimal accuracy loss and improved robustness.
Contribution
It introduces a layer pruning strategy based on CKA similarity that outperforms existing methods in efficiency and accuracy preservation.
Findings
Removes over 75% of computation while improving accuracy.
Maintains negligible accuracy drop at high compression levels.
Pruned models show increased robustness to adversarial and out-of-distribution samples.
Abstract
Deep neural networks have been the predominant paradigm in machine learning for solving cognitive tasks. Such models, however, are restricted by a high computational overhead, limiting their applicability and hindering advancements in the field. Extensive research demonstrated that pruning structures from these models is a straightforward approach to reducing network complexity. In this direction, most efforts focus on removing weights or filters. Studies have also been devoted to layer pruning as it promotes superior computational gains. However, layer pruning often hurts the network predictive ability (i.e., accuracy) at high compression rates. This work introduces an effective layer-pruning strategy that meets all underlying properties pursued by pruning methods. Our method estimates the relative importance of a layer using the Centered Kernel Alignment (CKA) metric, employed to…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsManufacturing Process and Optimization
MethodsFocus · Pruning
