TL;DR
This paper introduces ActiSiamese, an online active learning algorithm with siamese networks and memory, designed to classify nonstationary, imbalanced data streams efficiently and effectively.
Contribution
It proposes a novel density-based active learning strategy in the latent space and demonstrates superior performance over existing methods in various challenging streaming scenarios.
Findings
ActiSiamese outperforms baseline algorithms in nonstationary, imbalanced data streams.
The method is effective even with limited labeled data.
Memory and ensembling improve classification accuracy.
Abstract
We have witnessed in recent years an ever-growing volume of information becoming available in a streaming manner in various application areas. As a result, there is an emerging need for online learning methods that train predictive models on-the-fly. A series of open challenges, however, hinder their deployment in practice. These are, learning as data arrive in real-time one-by-one, learning from data with limited ground truth information, learning from nonstationary data, and learning from severely imbalanced data, while occupying a limited amount of memory for data storage. We propose the ActiSiamese algorithm, which addresses these challenges by combining online active learning, siamese networks, and a multi-queue memory. It develops a new density-based active learning strategy which considers similarity in the latent (rather than the input) space. We conduct an extensive study that…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
