On Deep Unsupervised Active Learning

Changsheng Li; Handong Ma; Zhao Kang; Ye Yuan; Xiao-Yu; Zhang; Guoren Wang

arXiv:2007.13959·cs.LG·July 29, 2020

On Deep Unsupervised Active Learning

Changsheng Li, Handong Ma, Zhao Kang, Ye Yuan, Xiao-Yu, Zhang, Guoren Wang

PDF

Open Access

TL;DR

This paper introduces DUAL, a deep neural network framework for unsupervised active learning that effectively models data nonlinearity and selects representative samples by preserving data structure in a learned latent space.

Contribution

The paper proposes a novel deep learning approach for unsupervised active learning that explicitly captures nonlinearity and data structure for better sample selection.

Findings

01

DUAL outperforms existing methods on six datasets.

02

The framework effectively preserves cluster structures.

03

Nonlinear embedding improves sample representativeness.

Abstract

Unsupervised active learning has attracted increasing attention in recent years, where its goal is to select representative samples in an unsupervised setting for human annotating. Most existing works are based on shallow linear models by assuming that each sample can be well approximated by the span (i.e., the set of all linear combinations) of certain selected samples, and then take these selected samples as representative ones to label. However, in practice, the data do not necessarily conform to linear models, and how to model nonlinearity of data often becomes the key point to success. In this paper, we present a novel Deep neural network framework for Unsupervised Active Learning, called DUAL. DUAL can explicitly learn a nonlinear embedding to map each input into a latent space through an encoder-decoder architecture, and introduce a selection block to select representative…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMachine Learning and Algorithms · Machine Learning and Data Classification · Anomaly Detection Techniques and Applications