Learning Relevant Features of Data with Multi-scale Tensor Networks

E.M. Stoudenmire

arXiv:1801.00315·stat.ML·May 1, 2018

Learning Relevant Features of Data with Multi-scale Tensor Networks

E.M. Stoudenmire

PDF

TL;DR

This paper introduces layered tree tensor networks inspired by physics for data feature extraction, demonstrating efficient unsupervised and supervised learning on image datasets with promising results.

Contribution

It adapts coarse-graining tensor network algorithms for data analysis, combining unsupervised and supervised methods for effective feature learning.

Findings

01

Linear scaling with input and dataset size

02

High accuracy on MNIST and fashion-MNIST

03

Effective feature reduction with prior knowledge

Abstract

Inspired by coarse-graining approaches used in physics, we show how similar algorithms can be adapted for data. The resulting algorithms are based on layered tree tensor networks and scale linearly with both the dimension of the input and the training set size. Computing most of the layers with an unsupervised algorithm, then optimizing just the top layer for supervised classification of the MNIST and fashion-MNIST data sets gives very good results. We also discuss mixing a prior guess for supervised weights together with an unsupervised representation of the data, yielding a smaller number of features nevertheless able to give good performance.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.