Online Unsupervised Learning of Visual Representations and Categories
Mengye Ren, Tyler R. Scott, Michael L. Iuzzolino, Michael C. Mozer,, Richard Zemel

TL;DR
This paper introduces an unsupervised online learning model that learns visual representations and categories from raw, nonstationary data streams without labels, outperforming existing self-supervised methods in category recognition.
Contribution
It proposes a prototype-based memory network with an online mixture model that forms new categories from single examples, handling imbalanced data and nonstationary environments.
Findings
Learned representations outperform state-of-the-art self-supervised methods in category recognition.
Model effectively forms new categories from single examples in an online setting.
Handles nonstationary data streams with imbalanced class distributions.
Abstract
Real world learning scenarios involve a nonstationary distribution of classes with sequential dependencies among the samples, in contrast to the standard machine learning formulation of drawing samples independently from a fixed, typically uniform distribution. Furthermore, real world interactions demand learning on-the-fly from few or no class labels. In this work, we propose an unsupervised model that simultaneously performs online visual representation learning and few-shot learning of new categories without relying on any class labels. Our model is a prototype-based memory network with a control component that determines when to form a new class prototype. We formulate it as an online mixture model, where components are created with only a single new example, and assignments do not have to be balanced, which permits an approximation to natural imbalanced distributions from uncurated…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDomain Adaptation and Few-Shot Learning · Multimodal Machine Learning Applications · Advanced Image and Video Retrieval Techniques
MethodsMemory Network
