ALiPy: Active Learning in Python
Ying-Peng Tang, Guo-Xiang Li, Sheng-Jun Huang

TL;DR
ALiPy is an open-source Python toolbox that facilitates the implementation, evaluation, and comparison of various active learning algorithms, supporting diverse settings like multi-label data and noisy annotators.
Contribution
It introduces a comprehensive, modular framework for active learning in Python, including over 20 algorithms and customization options for different scenarios.
Findings
Supports evaluation and comparison of active learning methods
Includes implementations of 20+ state-of-the-art algorithms
Enables customization for various active learning settings
Abstract
Supervised machine learning methods usually require a large set of labeled examples for model training. However, in many real applications, there are plentiful unlabeled data but limited labeled data; and the acquisition of labels is costly. Active learning (AL) reduces the labeling cost by iteratively selecting the most valuable data to query their labels from the annotator. This article introduces a Python toobox ALiPy for active learning. ALiPy provides a module based implementation of active learning framework, which allows users to conveniently evaluate, compare and analyze the performance of active learning methods. In the toolbox, multiple options are available for each component of the learning framework, including data process, active selection, label query, results visualization, etc. In addition to the implementations of more than 20 state-of-the-art active learning…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning and Algorithms · Machine Learning and Data Classification · Algorithms and Data Compression
