Deep Transformation-Invariant Clustering
Tom Monnier, Thibault Groueix, Mathieu Aubry

TL;DR
This paper introduces a novel deep clustering method that learns to predict image transformations directly in image space, enabling transformation invariance and interpretability without relying on abstract feature representations.
Contribution
It presents two new deep transformation-invariant clustering frameworks that jointly learn prototypes and transformations, enhancing interpretability and adaptability in image clustering.
Findings
Achieves competitive results on standard benchmarks.
Demonstrates robustness and interpretability in real photo collections.
Enables easy adaptation to different invariance types.
Abstract
Recent advances in image clustering typically focus on learning better deep representations. In contrast, we present an orthogonal approach that does not rely on abstract features but instead learns to predict image transformations and performs clustering directly in image space. This learning process naturally fits in the gradient-based training of K-means and Gaussian mixture model, without requiring any additional loss or hyper-parameters. It leads us to two new deep transformation-invariant clustering frameworks, which jointly learn prototypes and transformations. More specifically, we use deep learning modules that enable us to resolve invariance to spatial, color and morphological transformations. Our approach is conceptually simple and comes with several advantages, including the possibility to easily adapt the desired invariance to the task and a strong interpretability of both…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
TopicsAdvanced Image and Video Retrieval Techniques · Image Retrieval and Classification Techniques · Remote-Sensing Image Classification
MethodsInterpretability
