Switchable Activation Networks
Laha Ale, Ning Zhang, Scott A. King, Pingzhi Fan

TL;DR
SWAN introduces a neural network framework with input-dependent binary gates at each unit, enabling adaptive computation that reduces redundancy and maintains accuracy, thus improving efficiency for resource-constrained deployment.
Contribution
It proposes a novel dynamic activation control mechanism that unifies sparsity, pruning, and adaptive inference in a single, learnable framework.
Findings
Reduces computational redundancy while preserving accuracy.
Supports both efficient inference and compact model conversion.
Unifies multiple efficiency techniques into a single paradigm.
Abstract
Deep neural networks, and more recently large-scale generative models such as large language models (LLMs) and large vision-action models (LVAs), achieve remarkable performance across diverse domains, yet their prohibitive computational cost hinders deployment in resource-constrained environments. Existing efficiency techniques offer only partial remedies: dropout improves regularization during training but leaves inference unchanged, while pruning and low-rank factorization compress models post hoc into static forms with limited adaptability. Here we introduce SWAN (Switchable Activation Networks), a framework that equips each neural unit with a deterministic, input-dependent binary gate, enabling the network to learn when a unit should be active or inactive. This dynamic control mechanism allocates computation adaptively, reducing redundancy while preserving accuracy. Unlike…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMultimodal Machine Learning Applications · Generative Adversarial Networks and Image Synthesis · Domain Adaptation and Few-Shot Learning
