Pruning By Explaining Revisited: Optimizing Attribution Methods to Prune   CNNs and Transformers

Sayed Mohammad Vakilzadeh Hatefi; Maximilian Dreyer; Reduan Achtibat,; Thomas Wiegand; Wojciech Samek; Sebastian Lapuschkin

arXiv:2408.12568·cs.AI·October 24, 2024

Pruning By Explaining Revisited: Optimizing Attribution Methods to Prune CNNs and Transformers

Sayed Mohammad Vakilzadeh Hatefi, Maximilian Dreyer, Reduan Achtibat,, Thomas Wiegand, Wojciech Samek, Sebastian Lapuschkin

PDF

Open Access 1 Repo

TL;DR

This paper introduces a method to optimize attribution-based pruning hyperparameters, improving the compression of CNNs and Transformers while maintaining high accuracy on ImageNet.

Contribution

It proposes explicitly optimizing attribution method hyperparameters for pruning, extending analysis to transformer architectures, and achieving higher compression rates.

Findings

01

Transformers are more over-parameterized than CNNs.

02

Optimized attribution methods lead to better pruning results.

03

High model compression with maintained performance.

Abstract

To solve ever more complex problems, Deep Neural Networks are scaled to billions of parameters, leading to huge computational costs. An effective approach to reduce computational requirements and increase efficiency is to prune unnecessary components of these often over-parameterized networks. Previous work has shown that attribution methods from the field of eXplainable AI serve as effective means to extract and prune the least relevant network components in a few-shot fashion. We extend the current state by proposing to explicitly optimize hyperparameters of attribution methods for the task of pruning, and further include transformer-based networks in our analysis. Our approach yields higher model compression rates of large transformer- and convolutional architectures (VGG, ResNet, ViT) compared to previous works, while still attaining high performance on ImageNet classification…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

erfanhatefi/pruning-by-explaining-in-pytorch
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsExplainable Artificial Intelligence (XAI) · Machine Learning and Data Classification · Anomaly Detection Techniques and Applications

MethodsAverage Pooling · Global Average Pooling · Kaiming Initialization · Convolution · Max Pooling