Data Distillation: Towards Omni-Supervised Learning

Ilija Radosavovic; Piotr Doll\'ar; Ross Girshick; Georgia Gkioxari,; Kaiming He

arXiv:1712.04440·cs.CV·December 13, 2017·43 cites

Data Distillation: Towards Omni-Supervised Learning

Ilija Radosavovic, Piotr Doll\'ar, Ross Girshick, Georgia Gkioxari,, Kaiming He

PDF

Open Access 4 Repos

TL;DR

This paper introduces data distillation, a method for omni-supervised learning that leverages all available labeled data and unlabeled internet-scale data to improve visual recognition models beyond traditional supervised methods.

Contribution

The paper proposes data distillation, a novel ensemble-based approach for utilizing unlabeled data in omni-supervised learning, demonstrating improved performance in visual recognition tasks.

Findings

01

Data distillation surpasses COCO supervised performance in human keypoint detection.

02

It also improves object detection accuracy beyond labeled-only training.

03

The method effectively leverages unlabeled data for state-of-the-art results.

Abstract

We investigate omni-supervised learning, a special regime of semi-supervised learning in which the learner exploits all available labeled data plus internet-scale sources of unlabeled data. Omni-supervised learning is lower-bounded by performance on existing labeled datasets, offering the potential to surpass state-of-the-art fully supervised methods. To exploit the omni-supervised setting, we propose data distillation, a method that ensembles predictions from multiple transformations of unlabeled data, using a single model, to automatically generate new training annotations. We argue that visual recognition models have recently become accurate enough that it is now possible to apply classic ideas about self-training to challenging real-world data. Our experimental results show that in the cases of human keypoint detection and general object detection, state-of-the-art models trained…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMultimodal Machine Learning Applications · Domain Adaptation and Few-Shot Learning · Human Pose and Action Recognition

MethodsMask R-CNN · RoIAlign · 1x1 Convolution · Feature Pyramid Network · Region Proposal Network · Softmax · Convolution · RoIPool · Faster R-CNN