Channel Equilibrium Networks for Learning Deep Representation
Wenqi Shao, Shitao Tang, Xingang Pan, Ping Tan, Xiaogang Wang, Ping, Luo

TL;DR
This paper introduces Channel Equilibrium (CE) blocks that address channel inhibition in CNNs, leading to improved generalization and state-of-the-art results on benchmarks like ImageNet and COCO.
Contribution
The paper proposes a novel CE block that 'wakes up' inhibited channels, enhancing CNN performance and generalization, and connects it with Nash Equilibrium theory.
Findings
CE blocks improve CNN accuracy on ImageNet and COCO
CE can be integrated into architectures like ResNet and MobileNet
CE achieves state-of-the-art performance on multiple benchmarks
Abstract
Convolutional Neural Networks (CNNs) are typically constructed by stacking multiple building blocks, each of which contains a normalization layer such as batch normalization (BN) and a rectified linear function such as ReLU. However, this work shows that the combination of normalization and rectified linear function leads to inhibited channels, which have small magnitude and contribute little to the learned feature representation, impeding the generalization ability of CNNs. Unlike prior arts that simply removed the inhibited channels, we propose to "wake them up" during training by designing a novel neural building block, termed Channel Equilibrium (CE) block, which enables channels at the same layer to contribute equally to the learned representation. We show that CE is able to prevent inhibited channels both empirically and theoretically. CE has several appealing benefits. (1) It can…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
TopicsDomain Adaptation and Few-Shot Learning · Adversarial Robustness in Machine Learning · Advanced Neural Network Applications
MethodsAverage Pooling · *Communicated@Fast*How Do I Communicate to Expedia? · 1x1 Convolution · Bottleneck Residual Block · Global Average Pooling · Residual Block · Kaiming Initialization · Max Pooling · Residual Connection · Convolution
