Towards Principled Unsupervised Learning
Ilya Sutskever, Rafal Jozefowicz, Karol Gregor, Danilo Rezende, Tim, Lillicrap, Oriol Vinyals

TL;DR
This paper introduces the Output Distribution Matching (ODM) cost, a new unsupervised learning objective that aligns prediction distributions with label distributions, improving performance and enabling domain adaptation without labeled data.
Contribution
The paper proposes the ODM cost as a principled unsupervised objective that correlates with supervised performance and demonstrates its effectiveness on small datasets and domain adaptation tasks.
Findings
ODM cost aligns prediction and label distributions effectively.
Optimizing ODM improves supervised classification performance.
ODM enables one-shot domain adaptation without labeled data.
Abstract
General unsupervised learning is a long-standing conceptual problem in machine learning. Supervised learning is successful because it can be solved by the minimization of the training error cost function. Unsupervised learning is not as successful, because the unsupervised objective may be unrelated to the supervised task of interest. For an example, density modelling and reconstruction have often been used for unsupervised learning, but they did not produced the sought-after performance gains, because they have no knowledge of the supervised tasks. In this paper, we present an unsupervised cost function which we name the Output Distribution Matching (ODM) cost, which measures a divergence between the distribution of predictions and distributions of labels. The ODM cost is appealing because it is consistent with the supervised cost in the following sense: a perfect supervised…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Generative Adversarial Networks and Image Synthesis · Machine Learning in Healthcare
