Decoupled Networks
Weiyang Liu, Zhen Liu, Zhiding Yu, Bo Dai, Rongmei Lin, Yisen Wang,, James M. Rehg, Le Song

TL;DR
This paper introduces a decoupled learning framework for CNNs that models intra-class variation and semantic difference separately, leading to improved performance, convergence, and robustness.
Contribution
It proposes a novel decoupled convolution operator and a reparameterization method that enhances CNN learning by explicitly modeling feature components.
Findings
Significant performance improvements over standard CNNs.
Faster convergence and increased robustness.
Effective decoupled operators with geometric interpretation.
Abstract
Inner product-based convolution has been a central component of convolutional neural networks (CNNs) and the key to learning visual representations. Inspired by the observation that CNN-learned features are naturally decoupled with the norm of features corresponding to the intra-class variation and the angle corresponding to the semantic difference, we propose a generic decoupled learning framework which models the intra-class variation and semantic difference independently. Specifically, we first reparametrize the inner product to a decoupled form and then generalize it to the decoupled convolution operator which serves as the building block of our decoupled networks. We present several effective instances of the decoupled convolution operator. Each decoupled operator is well motivated and has an intuitive geometric interpretation. Based on these decoupled operators, we further propose…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDistributed systems and fault tolerance
MethodsConvolution
