LSUN: Construction of a Large-scale Image Dataset using Deep Learning   with Humans in the Loop

Fisher Yu; Ari Seff; Yinda Zhang; Shuran Song; Thomas Funkhouser,; Jianxiong Xiao

arXiv:1506.03365·cs.CV·June 7, 2016·1.4k cites

LSUN: Construction of a Large-scale Image Dataset using Deep Learning with Humans in the Loop

Fisher Yu, Ari Seff, Yinda Zhang, Shuran Song, Thomas Funkhouser,, Jianxiong Xiao

PDF

Open Access 4 Repos 10 Models

TL;DR

This paper introduces LSUN, a large-scale image dataset created through a semi-automated, human-in-the-loop labeling process, significantly enhancing training data for visual recognition models.

Contribution

The paper presents a novel semi-automated labeling scheme combining deep learning and human effort to efficiently construct a large-scale, diverse image dataset called LSUN.

Findings

01

Training convolutional networks on LSUN improves accuracy

02

The dataset contains around one million images per category

03

The labeling process is effective and scalable

Abstract

While there has been remarkable progress in the performance of visual recognition algorithms, the state-of-the-art models tend to be exceptionally data-hungry. Large labeled training datasets, expensive and tedious to produce, are required to optimize millions of parameters in deep network models. Lagging behind the growth in model capacity, the available datasets are quickly becoming outdated in terms of size and density. To circumvent this bottleneck, we propose to amplify human effort through a partially automated labeling scheme, leveraging deep learning with humans in the loop. Starting from a large set of candidate images for each category, we iteratively sample a subset, ask people to label them, classify the others with a trained model, split the set into positives, negatives, and unlabeled based on the classification confidence, and then iterate with the unlabeled set. To…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Models

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsDomain Adaptation and Few-Shot Learning · Advanced Image and Video Retrieval Techniques · Multimodal Machine Learning Applications