NeuroPrune: A Neuro-inspired Topological Sparse Training Algorithm for Large Language Models
Amit Dhurandhar, Tejaswini Pedapati, Ronny Luss, Soham Dan, Aurelie, Lozano, Payel Das, Georgios Kollias

TL;DR
NeuroPrune introduces a biologically inspired sparsity method for large language models, improving training efficiency and inference speed while maintaining competitive performance across NLP tasks.
Contribution
The paper presents a novel, model-agnostic sparsity approach inspired by brain network mechanisms, enhancing efficiency without sacrificing performance.
Findings
Up to 10x faster training times at similar sparsity levels
Competitive or superior performance compared to baselines
Measurable inference speed improvements in many cases
Abstract
Transformer-based Language Models have become ubiquitous in Natural Language Processing (NLP) due to their impressive performance on various tasks. However, expensive training as well as inference remains a significant impediment to their widespread applicability. While enforcing sparsity at various levels of the model architecture has found promise in addressing scaling and efficiency issues, there remains a disconnect between how sparsity affects network topology. Inspired by brain neuronal networks, we explore sparsity approaches through the lens of network topology. Specifically, we exploit mechanisms seen in biological networks, such as preferential attachment and redundant synapse pruning, and show that principled, model-agnostic sparsity approaches are performant and efficient across diverse NLP tasks, spanning both classification (such as natural language inference) and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Taxonomy
TopicsNatural Language Processing Techniques · Topic Modeling
