Towards a Definition of Disentangled Representations
Irina Higgins, David Amos, David Pfau, Sebastien Racaniere, Loic, Matthey, Danilo Rezende, Alexander Lerchner

TL;DR
This paper proposes a formal, transformation-based definition of disentangled representations inspired by symmetry transformations in physics, aiming to clarify the concept and guide future learning algorithms.
Contribution
It introduces the first formal definition of disentangled representations using group and representation theory, resolving previous ambiguities and aligning with core intuitions.
Findings
Defines disentangling through invariance to certain transformations
Connects symmetry transformations to vector representations
Provides a principled framework for future research
Abstract
How can intelligent agents solve a diverse set of tasks in a data-efficient manner? The disentangled representation learning approach posits that such an agent would benefit from separating out (disentangling) the underlying structure of the world into disjoint parts of its representation. However, there is no generally agreed-upon definition of disentangling, not least because it is unclear how to formalise the notion of world structure beyond toy datasets with a known ground truth generative process. Here we propose that a principled solution to characterising disentangled representations can be found by focusing on the transformation properties of the world. In particular, we suggest that those transformations that change only some properties of the underlying world state, while leaving all other properties invariant, are what gives exploitable structure to any kind of data. Similar…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGenerative Adversarial Networks and Image Synthesis · Digital Media Forensic Detection
