Decoupled Networks

Weiyang Liu; Zhen Liu; Zhiding Yu; Bo Dai; Rongmei Lin; Yisen Wang,; James M. Rehg; Le Song

arXiv:1804.08071·cs.CV·April 24, 2018

Decoupled Networks

Weiyang Liu, Zhen Liu, Zhiding Yu, Bo Dai, Rongmei Lin, Yisen Wang,, James M. Rehg, Le Song

PDF

Open Access 1 Repo

TL;DR

This paper introduces a decoupled learning framework for CNNs that models intra-class variation and semantic difference separately, leading to improved performance, convergence, and robustness.

Contribution

It proposes a novel decoupled convolution operator and a reparameterization method that enhances CNN learning by explicitly modeling feature components.

Findings

01

Significant performance improvements over standard CNNs.

02

Faster convergence and increased robustness.

03

Effective decoupled operators with geometric interpretation.

Abstract

Inner product-based convolution has been a central component of convolutional neural networks (CNNs) and the key to learning visual representations. Inspired by the observation that CNN-learned features are naturally decoupled with the norm of features corresponding to the intra-class variation and the angle corresponding to the semantic difference, we propose a generic decoupled learning framework which models the intra-class variation and semantic difference independently. Specifically, we first reparametrize the inner product to a decoupled form and then generalize it to the decoupled convolution operator which serves as the building block of our decoupled networks. We present several effective instances of the decoupled convolution operator. Each decoupled operator is well motivated and has an intuitive geometric interpretation. Based on these decoupled operators, we further propose…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

yujiacheng333/BaseDcLayer
tf

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsDistributed systems and fault tolerance

MethodsConvolution