DEFT: Distilling Entangled Factors by Preventing Information Diffusion

Jiantao Wu; Lin Wang; Bo Yang; Fanqi Li; Chunxiuzi Liu; Jin Zhou

arXiv:2102.03986·cs.LG·April 26, 2022

DEFT: Distilling Entangled Factors by Preventing Information Diffusion

Jiantao Wu, Lin Wang, Bo Yang, Fanqi Li, Chunxiuzi Liu, Jin Zhou

PDF

TL;DR

DEFT introduces a novel framework to improve disentanglement in representation learning by addressing the information diffusion problem through staged training and information scaling, achieving high-quality disentangled factors.

Contribution

The paper proposes DEFT, a new disentanglement method that prevents information diffusion by scaling backward information, with a multistage training strategy for better factor separation.

Findings

01

DEFT outperforms existing methods on dSprite and SmallNORB datasets.

02

DEFT achieves low-variance and high-level disentanglement scores.

03

DEFT is effective in unsupervised disentanglement scenarios.

Abstract

Disentanglement is a highly desirable property of representation owing to its similarity to human understanding and reasoning. Many works achieve disentanglement upon information bottlenecks (IB). Despite their elegant mathematical foundations, the IB branch usually exhibits lower performance. In order to provide an insight into the problem, we develop an annealing test to calculate the information freezing point (IFP), which is a transition state to freeze information into the latent variables. We also explore these clues or inductive biases for separating the entangled factors according to the differences in the IFP distributions. We found the existing approaches suffer from the information diffusion problem, according to which the increased information diffuses in all latent variables. Based on this insight, we propose a novel disentanglement framework, termed the distilling…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

MethodsDiffusion