Structured Pattern Pruning Using Regularization

Dongjun Park; Geung-Hee Lee

arXiv:2109.08814·cs.CL·September 21, 2021

Structured Pattern Pruning Using Regularization

Dongjun Park, Geung-Hee Lee

PDF

Open Access

TL;DR

This paper introduces SPUR, a regularization-based pruning method that induces structured patterns in neural networks, preserving performance at high sparsity levels across various languages and tasks.

Contribution

SPUR is a novel regularization technique that enhances iterative magnitude pruning by preemptively creating structured patterns, improving efficiency and performance.

Findings

01

SPUR significantly preserves model performance at high sparsity levels.

02

Structured patterns in weight matrices can be effectively induced by regularization.

03

SPUR is resource-efficient and applicable across different languages and tasks.

Abstract

Iterative Magnitude Pruning (IMP) is a network pruning method that repeats the process of removing weights with the least magnitudes and retraining the model. When visualizing the weight matrices of language models pruned by IMP, previous research has shown that a structured pattern emerges, wherein the resulting surviving weights tend to prominently cluster in a select few rows and columns of the matrix. Though the need for further research in utilizing these structured patterns for potential performance gains has previously been indicated, it has yet to be thoroughly studied. We propose SPUR (Structured Pattern pruning Using Regularization), a novel pruning mechanism that preemptively induces structured patterns in compression by adding a regularization term to the objective function in the IMP. Our results show that SPUR can significantly preserve model performance under high…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Speech Recognition and Synthesis

MethodsPruning