Momentum Auxiliary Network for Supervised Local Learning
Junhao Su, Changpeng Cai, Feiyu Zhu, Chenghao He, Xiaojie Xu, Dongzhi, Guan, Chenyang Si

TL;DR
The paper introduces the Momentum Auxiliary Network (MAN), a novel approach that enhances supervised local learning by enabling information transfer between local blocks through EMA-based auxiliary networks, leading to improved accuracy and significant memory savings.
Contribution
It proposes a dynamic interaction mechanism using EMA and learnable biases to improve local learning in neural networks, surpassing previous methods in accuracy and memory efficiency.
Findings
Achieves over 45% GPU memory reduction on ImageNet.
Outperforms traditional end-to-end training in accuracy on multiple datasets.
Validates effectiveness across four image classification datasets.
Abstract
Deep neural networks conventionally employ end-to-end backpropagation for their training process, which lacks biological credibility and triggers a locking dilemma during network parameter updates, leading to significant GPU memory use. Supervised local learning, which segments the network into multiple local blocks updated by independent auxiliary networks. However, these methods cannot replace end-to-end training due to lower accuracy, as gradients only propagate within their local block, creating a lack of information exchange between blocks. To address this issue and establish information transfer across blocks, we propose a Momentum Auxiliary Network (MAN) that establishes a dynamic interaction mechanism. The MAN leverages an exponential moving average (EMA) of the parameters from adjacent local blocks to enhance information flow. This auxiliary network, updated through EMA, helps…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNeural Networks and Applications
