Switchable Activation Networks

Laha Ale; Ning Zhang; Scott A. King; Pingzhi Fan

arXiv:2603.06601·cs.LG·March 10, 2026

Switchable Activation Networks

Laha Ale, Ning Zhang, Scott A. King, Pingzhi Fan

PDF

Open Access

TL;DR

SWAN introduces a neural network framework with input-dependent binary gates at each unit, enabling adaptive computation that reduces redundancy and maintains accuracy, thus improving efficiency for resource-constrained deployment.

Contribution

It proposes a novel dynamic activation control mechanism that unifies sparsity, pruning, and adaptive inference in a single, learnable framework.

Findings

01

Reduces computational redundancy while preserving accuracy.

02

Supports both efficient inference and compact model conversion.

03

Unifies multiple efficiency techniques into a single paradigm.

Abstract

Deep neural networks, and more recently large-scale generative models such as large language models (LLMs) and large vision-action models (LVAs), achieve remarkable performance across diverse domains, yet their prohibitive computational cost hinders deployment in resource-constrained environments. Existing efficiency techniques offer only partial remedies: dropout improves regularization during training but leaves inference unchanged, while pruning and low-rank factorization compress models post hoc into static forms with limited adaptability. Here we introduce SWAN (Switchable Activation Networks), a framework that equips each neural unit with a deterministic, input-dependent binary gate, enabling the network to learn when a unit should be active or inactive. This dynamic control mechanism allocates computation adaptively, reducing redundancy while preserving accuracy. Unlike…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMultimodal Machine Learning Applications · Generative Adversarial Networks and Image Synthesis · Domain Adaptation and Few-Shot Learning