Alternating Gradient Flow Utility: A Unified Metric for Structural Pruning and Dynamic Routing in Deep Networks
Tianhao Qian, Zhuoxuan Li, Jinde Cao, Xinli Shi, Leszek Rutkowski

TL;DR
This paper introduces a new metric called Alternating Gradient Flow Utility for structural pruning and dynamic routing in deep networks, addressing limitations of traditional magnitude-based metrics and enabling more efficient, topologically aware network compression and routing.
Contribution
It proposes a decoupled kinetic paradigm based on AGF that captures network utility more accurately, revealing phase transitions and bottlenecks, and develops a hybrid routing framework validated on large-scale benchmarks.
Findings
AGF preserves functionality at extreme sparsity levels.
Identifies a sparsity bottleneck in Vision Transformers.
Achieves 50% expert reduction with maintained accuracy.
Abstract
Efficient deep learning traditionally relies on static heuristics like weight magnitude or activation awareness (e.g., Wanda, RIA). While successful in unstructured settings, we observe a critical limitation when applying these metrics to the structural pruning of deep vision networks. These contemporary metrics suffer from a magnitude bias, failing to preserve critical functional pathways. To overcome this, we propose a decoupled kinetic paradigm inspired by Alternating Gradient Flow (AGF), utilizing an absolute feature-space Taylor expansion to accurately capture the network's structural "kinetic utility". First, we uncover a topological phase transition at extreme sparsity, where AGF successfully preserves baseline functionality and exhibits topological implicit regularization, avoiding the collapse seen in models trained from scratch. Second, transitioning to architectures without…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Adversarial Robustness in Machine Learning · Generative Adversarial Networks and Image Synthesis
