Pool-Based Sequential Active Learning for Regression

Dongrui Wu

arXiv:1805.04735·cs.LG·March 31, 2020·1 cites

Pool-Based Sequential Active Learning for Regression

Dongrui Wu

PDF

Open Access 1 Repo

TL;DR

This paper introduces a new pool-based sequential active learning method for regression that emphasizes informativeness, representativeness, and diversity, and demonstrates its effectiveness through extensive experiments.

Contribution

It proposes a novel ALR approach combining passive sampling with existing methods, enhancing sample selection for regression tasks.

Findings

01

The new approach outperforms existing ALR methods on multiple datasets.

02

Incorporating representativeness and diversity improves sample efficiency.

03

The method is versatile and can be integrated with other ALR approaches.

Abstract

Active learning is a machine learning approach for reducing the data labeling effort. Given a pool of unlabeled samples, it tries to select the most useful ones to label so that a model built from them can achieve the best possible performance. This paper focuses on pool-based sequential active learning for regression (ALR). We first propose three essential criteria that an ALR approach should consider in selecting the most useful unlabeled samples: informativeness, representativeness, and diversity, and compare four existing ALR approaches against them. We then propose a new ALR approach using passive sampling, which considers both the representativeness and the diversity in both the initialization and subsequent iterations. Remarkably, this approach can also be integrated with other existing ALR approaches in the literature to further improve the performance. Extensive experiments on…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

drwuHUST/sAL
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMachine Learning and Algorithms · Machine Learning and Data Classification · Imbalanced Data Classification Techniques