NeuroPrune: A Neuro-inspired Topological Sparse Training Algorithm for   Large Language Models

Amit Dhurandhar; Tejaswini Pedapati; Ronny Luss; Soham Dan; Aurelie; Lozano; Payel Das; Georgios Kollias

arXiv:2404.01306·cs.LG·June 6, 2024·2 cites

NeuroPrune: A Neuro-inspired Topological Sparse Training Algorithm for Large Language Models

Amit Dhurandhar, Tejaswini Pedapati, Ronny Luss, Soham Dan, Aurelie, Lozano, Payel Das, Georgios Kollias

PDF

Open Access 1 Video

TL;DR

NeuroPrune introduces a biologically inspired sparsity method for large language models, improving training efficiency and inference speed while maintaining competitive performance across NLP tasks.

Contribution

The paper presents a novel, model-agnostic sparsity approach inspired by brain network mechanisms, enhancing efficiency without sacrificing performance.

Findings

01

Up to 10x faster training times at similar sparsity levels

02

Competitive or superior performance compared to baselines

03

Measurable inference speed improvements in many cases

Abstract

Transformer-based Language Models have become ubiquitous in Natural Language Processing (NLP) due to their impressive performance on various tasks. However, expensive training as well as inference remains a significant impediment to their widespread applicability. While enforcing sparsity at various levels of the model architecture has found promise in addressing scaling and efficiency issues, there remains a disconnect between how sparsity affects network topology. Inspired by brain neuronal networks, we explore sparsity approaches through the lens of network topology. Specifically, we exploit mechanisms seen in biological networks, such as preferential attachment and redundant synapse pruning, and show that principled, model-agnostic sparsity approaches are performant and efficient across diverse NLP tasks, spanning both classification (such as natural language inference) and…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

NeuroPrune: A Neuro-inspired Topological Sparse Training Algorithm for Large Language Models· underline

Taxonomy

TopicsNatural Language Processing Techniques · Topic Modeling