Disentangling the independently controllable factors of variation by   interacting with the world

Valentin Thomas; Emmanuel Bengio; William Fedus; Jules Pondard,; Philippe Beaudoin; Hugo Larochelle; Joelle Pineau; Doina Precup; Yoshua; Bengio

arXiv:1802.09484·stat.ML·February 27, 2018·46 cites

Disentangling the independently controllable factors of variation by interacting with the world

Valentin Thomas, Emmanuel Bengio, William Fedus, Jules Pondard,, Philippe Beaudoin, Hugo Larochelle, Joelle Pineau, Doina Precup, Yoshua, Bengio

PDF

Open Access

TL;DR

This paper proposes a method for discovering independently controllable factors of variation in an environment by interacting with it, aiming to improve representation learning without relying on static data or external rewards.

Contribution

It introduces a novel objective function that enables the discovery of controllable factors through interaction, advancing unsupervised disentanglement in dynamic environments.

Findings

01

Successfully disentangles controllable environmental factors

02

Operates without external reward signals

03

Works in interactive, dynamic settings

Abstract

It has been postulated that a good representation is one that disentangles the underlying explanatory factors of variation. However, it remains an open question what kind of training framework could potentially achieve that. Whereas most previous work focuses on the static setting (e.g., with images), we postulate that some of the causal factors could be discovered if the learner is allowed to interact with its environment. The agent can experiment with different actions and observe their effects. More specifically, we hypothesize that some of these factors correspond to aspects of the environment which are independently controllable, i.e., that there exists a policy and a learnable feature for each such aspect of the environment, such that this policy can yield changes in that feature with minimal changes to other features that explain the statistical variations in the observed data.…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNeural Networks and Applications · Generative Adversarial Networks and Image Synthesis · Machine Learning and Algorithms