LSUN: Construction of a Large-scale Image Dataset using Deep Learning with Humans in the Loop
Fisher Yu, Ari Seff, Yinda Zhang, Shuran Song, Thomas Funkhouser,, Jianxiong Xiao

TL;DR
This paper introduces LSUN, a large-scale image dataset created through a semi-automated, human-in-the-loop labeling process, significantly enhancing training data for visual recognition models.
Contribution
The paper presents a novel semi-automated labeling scheme combining deep learning and human effort to efficiently construct a large-scale, diverse image dataset called LSUN.
Findings
Training convolutional networks on LSUN improves accuracy
The dataset contains around one million images per category
The labeling process is effective and scalable
Abstract
While there has been remarkable progress in the performance of visual recognition algorithms, the state-of-the-art models tend to be exceptionally data-hungry. Large labeled training datasets, expensive and tedious to produce, are required to optimize millions of parameters in deep network models. Lagging behind the growth in model capacity, the available datasets are quickly becoming outdated in terms of size and density. To circumvent this bottleneck, we propose to amplify human effort through a partially automated labeling scheme, leveraging deep learning with humans in the loop. Starting from a large set of candidate images for each category, we iteratively sample a subset, ask people to label them, classify the others with a trained model, split the set into positives, negatives, and unlabeled based on the classification confidence, and then iterate with the unlabeled set. To…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
- 🤗dg845/consistency-model-pipelinesmodel· 28 dl· ♡ 128 dl♡ 1
- 🤗dg845/diffusers-cd_bedroom256_l2model· 5 dl5 dl
- 🤗dg845/diffusers-cd_cat256_l2model· 2 dl2 dl
- 🤗dg845/diffusers-cd_imagenet64_lpipsmodel· 6 dl6 dl
- 🤗dg845/diffusers-ct_imagenet64model· 1 dl1 dl
- 🤗dg845/diffusers-cd_bedroom256_lpipsmodel· 1 dl1 dl
- 🤗dg845/diffusers-ct_bedroom256model· 3 dl3 dl
- 🤗dg845/diffusers-cd_cat256_lpipsmodel· 5 dl5 dl
- 🤗dg845/diffusers-ct_cat256model· 5 dl5 dl
- 🤗openai/diffusers-ct_cat256model· 5 dl· ♡ 35 dl♡ 3
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDomain Adaptation and Few-Shot Learning · Advanced Image and Video Retrieval Techniques · Multimodal Machine Learning Applications
