TL;DR
This paper introduces a modular compositional bias approach for disentangled representation learning that uses factor-specific remixing strategies, enabling flexible and joint disentanglement of attributes and objects without changing architectures or objectives.
Contribution
It proposes a novel modular inductive bias framework that decouples from objectives and architectures, allowing flexible disentanglement through remixing strategies based on data recombination rules.
Findings
Achieves competitive attribute and object disentanglement.
Successfully disentangles global style and objects jointly.
Demonstrates flexibility across different factors of variation.
Abstract
Recent disentangled representation learning (DRL) methods heavily rely on factor specific strategies-either learning objectives for attributes or model architectures for objects-to embed inductive biases. Such divergent approaches result in significant overhead when novel factors of variation do not align with prior assumptions, such as statistical independence or spatial exclusivity, or when multiple factors coexist, as practitioners must redesign architectures or objectives. To address this, we propose a compositional bias, a modular inductive bias decoupled from both objectives and architectures. Our key insight is that different factors obey distinct recombination rules in the data distribution: global attributes are mutually exclusive, e.g., a face has one nose, while objects share a common support (any subset of objects can co-exist). We therefore randomly remix latents according…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
