Online Unsupervised Learning of Visual Representations and Categories

Mengye Ren; Tyler R. Scott; Michael L. Iuzzolino; Michael C. Mozer,; Richard Zemel

arXiv:2109.05675·cs.CV·May 31, 2022

Online Unsupervised Learning of Visual Representations and Categories

Mengye Ren, Tyler R. Scott, Michael L. Iuzzolino, Michael C. Mozer,, Richard Zemel

PDF

Open Access 1 Repo

TL;DR

This paper introduces an unsupervised online learning model that learns visual representations and categories from raw, nonstationary data streams without labels, outperforming existing self-supervised methods in category recognition.

Contribution

It proposes a prototype-based memory network with an online mixture model that forms new categories from single examples, handling imbalanced data and nonstationary environments.

Findings

01

Learned representations outperform state-of-the-art self-supervised methods in category recognition.

02

Model effectively forms new categories from single examples in an online setting.

03

Handles nonstationary data streams with imbalanced class distributions.

Abstract

Real world learning scenarios involve a nonstationary distribution of classes with sequential dependencies among the samples, in contrast to the standard machine learning formulation of drawing samples independently from a fixed, typically uniform distribution. Furthermore, real world interactions demand learning on-the-fly from few or no class labels. In this work, we propose an unsupervised model that simultaneously performs online visual representation learning and few-shot learning of new categories without relying on any class labels. Our model is a prototype-based memory network with a control component that determines when to form a new class prototype. We formulate it as an online mixture model, where components are created with only a single new example, and assignments do not have to be balanced, which permits an approximation to natural imbalanced distributions from uncurated…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

renmengye/online-unsup-proto-net
tfOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsDomain Adaptation and Few-Shot Learning · Multimodal Machine Learning Applications · Advanced Image and Video Retrieval Techniques

MethodsMemory Network