Zeroth-Order Adaptive Neuron Alignment Based Pruning without Re-Training
Elia Cunegatti, Leonardo Lucio Custode, Giovanni Iacca

TL;DR
NeuroAl is a novel pruning method that enhances sparse large language models by maximizing neuron activation alignment without requiring re-training, outperforming existing methods across various tasks and sparsity levels.
Contribution
The paper introduces NeuroAl, a re-training-free, adaptive neuron alignment-based pruning algorithm that improves sparse LLM performance by leveraging activation information.
Findings
Consistently outperforms state-of-the-art pruning methods.
Effective across multiple LLM families and tasks.
Requires no re-training, saving computational resources.
Abstract
Network pruning focuses on algorithms that aim to reduce a given model's computational cost by removing a subset of its parameters while having minimal impact on performance. Throughout the last decade, the most widely used pruning paradigm has been pruning and re-training, which nowadays is inconvenient due to the vast amount of pre-trained models, which are, in any case, too expensive to re-train. In this paper, we exploit functional information from dense pre-trained models, i.e., their input activations, to obtain sparse models that maximize the activations' alignment with respect to their corresponding dense models. Hence, we propose \textbf{NeuroAl}, a \emph{top-up} algorithm that can be used on top of any given pruning algorithm for LLMs, which modifies the block-wise and row-wise sparsity, exploiting information from both the dense model and its sparse version to maximize the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNeural Networks and Applications · Metaheuristic Optimization Algorithms Research
MethodsPruning · Sparse Evolutionary Training
