Unsupervised Feature Learning with Emergent Data-Driven Prototypicality

Yunhui Guo; Youren Zhang; Yubei Chen; Stella X. Yu

arXiv:2307.01421·cs.CV·July 6, 2023·1 cites

Unsupervised Feature Learning with Emergent Data-Driven Prototypicality

Yunhui Guo, Youren Zhang, Yubei Chen, Stella X. Yu

PDF

Open Access 3 Reviews

TL;DR

This paper introduces an unsupervised method for feature learning in hyperbolic space that encodes image prototypicality through spatial positioning, improving instance selection and model robustness.

Contribution

It proposes a novel hyperbolic space-based unsupervised feature learning algorithm using sphere packing, capturing prototypicality directly from data without labels.

Findings

01

Images closer to the origin are more prototypical.

02

Hyperbolic space captures prototypicality more effectively than Euclidean.

03

Method improves sample efficiency and model robustness.

Abstract

Given an image set without any labels, our goal is to train a model that maps each image to a point in a feature space such that, not only proximity indicates visual similarity, but where it is located directly encodes how prototypical the image is according to the dataset. Our key insight is to perform unsupervised feature learning in hyperbolic instead of Euclidean space, where the distance between points still reflect image similarity, and yet we gain additional capacity for representing prototypicality with the location of the point: The closer it is to the origin, the more prototypical it is. The latter property is simply emergent from optimizing the usual metric learning objective: The image similar to many training instances is best placed at the center of corresponding points in Euclidean space, but closer to the origin in hyperbolic space. We propose an unsupervised feature…

Peer Reviews

Decision·ICLR 2024 Conference Withdrawn Submission

Reviewer 01Rating 5· marginally below the acceptance thresholdConfidence 4

Strengths

In this work, the authors propose the unsupervised feature learning method from a novel perspective that aims to capture both visual similarity and prototypicality.

Weaknesses

1. The motivation of the proposed method is not clear. It lacks clarification of motivation to state that: what are the shortcomings of existing methods that do not consider prototypicality? Why does the unsupervised feature learning method need to consider prototypicality? The motivation mentioned in the first paragraph of Section 1 is too vague. 2. In the paper, the work has limited motivation, which seems to be a combination of existing technologies with introducing existing concepts. 3. It i

Reviewer 02Rating 5· marginally below the acceptance thresholdConfidence 2

Strengths

Strength: 1. Paper is well organized. 2. The use of hyperbolic space instead of Euclidean space is well-motivated.

Weaknesses

Weakness: 1. CIFAR and MNIST are too toy. ImageNet experiment and fair comparison with previous unsupervised learning (especially contrastive learning) are important, but missing in this work. 2. LeNet is also too toy for a fair comparison with the latest results on unsupervised learning. A model of the ResNet level is a must. 3. Some related works on prototype learning are not cited, like “Prototypical Contrastive Learning of Unsupervised Representations”.

Reviewer 03Rating 3· reject, not good enoughConfidence 4

Strengths

- The proposed unsupervised method HACK does have clear distinctions with existing methods: unlike supervised learning, HACK allows the image to be assigned to any target (particle). Unlike existing unsupervised learning method, HACK learns to match to a predefined geometrical organization in hyperbolic space (uniformly distributed). - The core instance assignment problem is cast as a bipartite matching problem and solved with the well-known Hungarian algorithm that has good convergence properti

Weaknesses

I think the presentation of this paper needs improvements. One main issue is that the authors keep talking about how HACK works and how it can encode both visual similarity and prototypicality, without enough explanations about the reason why. It's suggested to list the intuitions upfront, so readers won't always question why HACK is designed this way and why it works at all. Specifically, - Missing intuition everywhere about why images should be assigned to uniformly distributed particles. Only

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsCell Image Analysis Techniques · Digital Imaging for Blood Diseases · Machine Learning and Data Classification