MAN++: Scaling Momentum Auxiliary Network for Supervised Local Learning in Vision Tasks

Junhao Su; Feiyu Zhu; Hengyu Shi; Tianyang Han; Yurui Qiu; Junfeng Luo; Xiaoming Wei; Jialin Gao

arXiv:2507.16279·cs.CV·March 31, 2026

MAN++: Scaling Momentum Auxiliary Network for Supervised Local Learning in Vision Tasks

Junhao Su, Feiyu Zhu, Hengyu Shi, Tianyang Han, Yurui Qiu, Junfeng Luo, Xiaoming Wei, Jialin Gao

PDF

TL;DR

MAN++ introduces a novel auxiliary network with EMA-based inter-block communication and learnable bias, enabling efficient supervised local learning with reduced memory usage across vision tasks.

Contribution

The paper proposes MAN++, a new auxiliary network that enhances local learning by using EMA of parameters and learnable bias, improving performance and efficiency.

Findings

01

MAN++ achieves comparable performance to end-to-end training.

02

It significantly reduces GPU memory consumption.

03

Effective across multiple vision tasks and architectures.

Abstract

Deep learning typically relies on end-to-end backpropagation for training, a method that inherently suffers from issues such as update locking during parameter optimization, high GPU memory consumption, and a lack of biological plausibility. In contrast, supervised local learning seeks to mitigate these challenges by partitioning the network into multiple local blocks and designing independent auxiliary networks to update each block separately. However, because gradients are propagated solely within individual local blocks, performance degradation occurs, preventing supervised local learning from supplanting end-to-end backpropagation. To address these limitations and facilitate inter-block information flow, we propose the Momentum Auxiliary Network++ (MAN++). MAN++ introduces a dynamic interaction mechanism by employing the Exponential Moving Average (EMA) of parameters from adjacent…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.