MCA: Moment Channel Attention Networks
Yangbo Jiang, Zhiwei Jiang, Le Han, Zenan Huang, Nenggan Zheng

TL;DR
This paper introduces the MCA framework, leveraging high-order statistical moments and a novel EMA mechanism to improve channel attention, leading to state-of-the-art results across multiple vision tasks with minimal computational overhead.
Contribution
The paper proposes a new Moment Channel Attention (MCA) method using extensive moment aggregation and cross moment convolution to enhance model capacity efficiently.
Findings
Achieves state-of-the-art performance on image classification, detection, and segmentation.
Outperforms existing channel attention methods with minimal additional computation.
Demonstrates the effectiveness of high-order moments in neural network attention mechanisms.
Abstract
Channel attention mechanisms endeavor to recalibrate channel weights to enhance representation abilities of networks. However, mainstream methods often rely solely on global average pooling as the feature squeezer, which significantly limits the overall potential of models. In this paper, we investigate the statistical moments of feature maps within a neural network. Our findings highlight the critical role of high-order moments in enhancing model capacity. Consequently, we introduce a flexible and comprehensive mechanism termed Extensive Moment Aggregation (EMA) to capture the global spatial context. Building upon this mechanism, we propose the Moment Channel Attention (MCA) framework, which efficiently incorporates multiple levels of moment-based information while minimizing additional computation costs through our Cross Moment Convolution (CMC) module. The CMC module via channel-wise…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNeural Networks and Applications · Neural Networks and Reservoir Computing · Anomaly Detection Techniques and Applications
MethodsGlobal Average Pooling · Average Pooling · Convolution
