Zeroth-Order Adaptive Neuron Alignment Based Pruning without Re-Training

Elia Cunegatti; Leonardo Lucio Custode; Giovanni Iacca

arXiv:2411.07066·cs.LG·November 11, 2025

Zeroth-Order Adaptive Neuron Alignment Based Pruning without Re-Training

Elia Cunegatti, Leonardo Lucio Custode, Giovanni Iacca

PDF

Open Access 1 Repo

TL;DR

NeuroAl is a novel pruning method that enhances sparse large language models by maximizing neuron activation alignment without requiring re-training, outperforming existing methods across various tasks and sparsity levels.

Contribution

The paper introduces NeuroAl, a re-training-free, adaptive neuron alignment-based pruning algorithm that improves sparse LLM performance by leveraging activation information.

Findings

01

Consistently outperforms state-of-the-art pruning methods.

02

Effective across multiple LLM families and tasks.

03

Requires no re-training, saving computational resources.

Abstract

Network pruning focuses on algorithms that aim to reduce a given model's computational cost by removing a subset of its parameters while having minimal impact on performance. Throughout the last decade, the most widely used pruning paradigm has been pruning and re-training, which nowadays is inconvenient due to the vast amount of pre-trained models, which are, in any case, too expensive to re-train. In this paper, we exploit functional information from dense pre-trained models, i.e., their input activations, to obtain sparse models that maximize the activations' alignment with respect to their corresponding dense models. Hence, we propose \textbf{NeuroAl}, a \emph{top-up} algorithm that can be used on top of any given pruning algorithm for LLMs, which modifies the block-wise and row-wise sparsity, exploiting information from both the dense model and its sparse version to maximize the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

eliacunegatti/neuroal
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNeural Networks and Applications · Metaheuristic Optimization Algorithms Research

MethodsPruning · Sparse Evolutionary Training