A Separation Principle for Control in the Age of Deep Learning

Alessandro Achille; Stefano Soatto

arXiv:1711.03321·stat.ML·November 10, 2017

A Separation Principle for Control in the Age of Deep Learning

Alessandro Achille, Stefano Soatto

PDF

TL;DR

This paper proposes a separation principle for control systems using deep learning to infer minimal, task-relevant state representations from complex data streams, enabling efficient and invariant control in high-dimensional environments.

Contribution

It introduces a novel framework combining the Information Bottleneck principle with deep neural networks to learn dynamic, minimal state representations for control without assuming Markovianity.

Findings

01

Representation can be inferred by minimizing the Information Bottleneck Lagrangian.

02

Deep neural networks can approximate the posterior density of the task variable.

03

The approach yields invariant, low-complexity representations suitable for control.

Abstract

We review the problem of defining and inferring a "state" for a control system based on complex, high-dimensional, highly uncertain measurement streams such as videos. Such a state, or representation, should contain all and only the information needed for control, and discount nuisance variability in the data. It should also have finite complexity, ideally modulated depending on available resources. This representation is what we want to store in memory in lieu of the data, as it "separates" the control task from the measurement process. For the trivial case with no dynamics, a representation can be inferred by minimizing the Information Bottleneck Lagrangian in a function class realized by deep neural networks. The resulting representation has much higher dimension than the data, already in the millions, but it is smaller in the sense of information content, retaining only what is…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.