Unsupervised Feature Learning with Emergent Data-Driven Prototypicality
Yunhui Guo, Youren Zhang, Yubei Chen, Stella X. Yu

TL;DR
This paper introduces an unsupervised method for feature learning in hyperbolic space that encodes image prototypicality through spatial positioning, improving instance selection and model robustness.
Contribution
It proposes a novel hyperbolic space-based unsupervised feature learning algorithm using sphere packing, capturing prototypicality directly from data without labels.
Findings
Images closer to the origin are more prototypical.
Hyperbolic space captures prototypicality more effectively than Euclidean.
Method improves sample efficiency and model robustness.
Abstract
Given an image set without any labels, our goal is to train a model that maps each image to a point in a feature space such that, not only proximity indicates visual similarity, but where it is located directly encodes how prototypical the image is according to the dataset. Our key insight is to perform unsupervised feature learning in hyperbolic instead of Euclidean space, where the distance between points still reflect image similarity, and yet we gain additional capacity for representing prototypicality with the location of the point: The closer it is to the origin, the more prototypical it is. The latter property is simply emergent from optimizing the usual metric learning objective: The image similar to many training instances is best placed at the center of corresponding points in Euclidean space, but closer to the origin in hyperbolic space. We propose an unsupervised feature…
Peer Reviews
Decision·ICLR 2024 Conference Withdrawn Submission
In this work, the authors propose the unsupervised feature learning method from a novel perspective that aims to capture both visual similarity and prototypicality.
1. The motivation of the proposed method is not clear. It lacks clarification of motivation to state that: what are the shortcomings of existing methods that do not consider prototypicality? Why does the unsupervised feature learning method need to consider prototypicality? The motivation mentioned in the first paragraph of Section 1 is too vague. 2. In the paper, the work has limited motivation, which seems to be a combination of existing technologies with introducing existing concepts. 3. It i
Strength: 1. Paper is well organized. 2. The use of hyperbolic space instead of Euclidean space is well-motivated.
Weakness: 1. CIFAR and MNIST are too toy. ImageNet experiment and fair comparison with previous unsupervised learning (especially contrastive learning) are important, but missing in this work. 2. LeNet is also too toy for a fair comparison with the latest results on unsupervised learning. A model of the ResNet level is a must. 3. Some related works on prototype learning are not cited, like “Prototypical Contrastive Learning of Unsupervised Representations”.
- The proposed unsupervised method HACK does have clear distinctions with existing methods: unlike supervised learning, HACK allows the image to be assigned to any target (particle). Unlike existing unsupervised learning method, HACK learns to match to a predefined geometrical organization in hyperbolic space (uniformly distributed). - The core instance assignment problem is cast as a bipartite matching problem and solved with the well-known Hungarian algorithm that has good convergence properti
I think the presentation of this paper needs improvements. One main issue is that the authors keep talking about how HACK works and how it can encode both visual similarity and prototypicality, without enough explanations about the reason why. It's suggested to list the intuitions upfront, so readers won't always question why HACK is designed this way and why it works at all. Specifically, - Missing intuition everywhere about why images should be assigned to uniformly distributed particles. Only
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsCell Image Analysis Techniques · Digital Imaging for Blood Diseases · Machine Learning and Data Classification
