Target-Embedding Autoencoders for Supervised Representation Learning

Daniel Jarrett; Mihaela van der Schaar

arXiv:2001.08345·stat.ML·January 24, 2020·5 cites

Target-Embedding Autoencoders for Supervised Representation Learning

Daniel Jarrett, Mihaela van der Schaar

PDF

Open Access

TL;DR

This paper introduces target-embedding autoencoders (TEA), a supervised learning framework that improves generalization by learning intermediate representations predictive of high-dimensional targets, with theoretical guarantees and empirical validation across various architectures.

Contribution

The paper formalizes TEA for supervised prediction, providing a generalization guarantee and demonstrating its effectiveness in sequence forecasting tasks beyond static classification.

Findings

01

TEA offers a regularization effect through auxiliary reconstruction.

02

TEA improves performance in multivariate sequence forecasting.

03

Theoretical stability guarantees support TEA's generalization ability.

Abstract

Autoencoder-based learning has emerged as a staple for disciplining representations in unsupervised and semi-supervised settings. This paper analyzes a framework for improving generalization in a purely supervised setting, where the target space is high-dimensional. We motivate and formalize the general framework of target-embedding autoencoders (TEA) for supervised prediction, learning intermediate latent representations jointly optimized to be both predictable from features as well as predictive of targets---encoding the prior that variations in targets are driven by a compact set of underlying factors. As our theoretical contribution, we provide a guarantee of generalization for linear TEAs by demonstrating uniform stability, interpreting the benefit of the auxiliary reconstruction task as a form of regularization. As our empirical contribution, we extend validation of this approach…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Generative Adversarial Networks and Image Synthesis · Domain Adaptation and Few-Shot Learning