Maximally Informative Hierarchical Representations of High-Dimensional   Data

Greg Ver Steeg; Aram Galstyan

arXiv:1410.7404·stat.ML·February 3, 2015·27 cites

Maximally Informative Hierarchical Representations of High-Dimensional Data

Greg Ver Steeg, Aram Galstyan

PDF

Open Access 3 Repos

TL;DR

This paper introduces a principled, efficient method for constructing hierarchical data representations that maximize information retention, enabling unsupervised deep learning with linear complexity and practical applicability.

Contribution

It develops bounds on information content in hierarchical representations and proposes a simple, scalable optimization procedure for learning maximally informative deep representations.

Findings

01

Effective hierarchical representations capture most data information.

02

Linear computational complexity makes the method scalable.

03

Demonstrated success on synthetic and real-world datasets.

Abstract

We consider a set of probabilistic functions of some input variables as a representation of the inputs. We present bounds on how informative a representation is about input data. We extend these bounds to hierarchical representations so that we can quantify the contribution of each layer towards capturing the information in the original data. The special form of these bounds leads to a simple, bottom-up optimization procedure to construct hierarchical representations that are also maximally informative about the data. This optimization has linear computational complexity and constant sample complexity in the number of variables. These results establish a new approach to unsupervised learning of deep representations that is both principled and practical. We demonstrate the usefulness of the approach on both synthetic and real-world data.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsGaussian Processes and Bayesian Inference · Machine Learning and Algorithms · Adversarial Robustness in Machine Learning