Discovery of Visual Semantics by Unsupervised and Self-Supervised Representation Learning
Gustav Larsson

TL;DR
This paper introduces a self-supervised image colorization method that enhances representation learning and visual task performance without labeled data, outperforming traditional unsupervised techniques and revitalizing old photographs.
Contribution
Develops a fully automatic image colorization approach for self-supervised learning, setting new state-of-the-art in colorizing black-and-white images and improving visual task performance.
Findings
Self-supervised colorization outperforms traditional unsupervised methods.
Colorization enables learning representations useful for classification and segmentation.
The approach revitalizes old black-and-white photographs without human effort.
Abstract
The success of deep learning in computer vision is rooted in the ability of deep networks to scale up model complexity as demanded by challenging visual tasks. As complexity is increased, so is the need for large amounts of labeled data to train the model. This is associated with a costly human annotation effort. To address this concern, with the long-term goal of leveraging the abundance of cheap unlabeled data, we explore methods of unsupervised "pre-training." In particular, we propose to use self-supervised automatic image colorization. We show that traditional methods for unsupervised learning, such as layer-wise clustering or autoencoders, remain inferior to supervised pre-training. In search for an alternative, we develop a fully automatic image colorization method. Our method sets a new state-of-the-art in revitalizing old black-and-white photography, without requiring human…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDomain Adaptation and Few-Shot Learning · Generative Adversarial Networks and Image Synthesis · Face recognition and analysis
