Progressive Cluster Purification for Unsupervised Feature Learning
Yifei Zhang, Chang Liu, Yu Zhou, Wei Wang, Weiping Wang, Qixiang Ye

TL;DR
This paper introduces Progressive Cluster Purification (PCP), a novel unsupervised feature learning method that iteratively refines clusters by removing noise, leading to improved discriminative representations.
Contribution
The paper proposes a new clustering-based approach that progressively refines clusters by excluding inconsistent samples, enhancing unsupervised feature learning performance.
Findings
Significant performance improvements over baseline methods.
Effective noise filtering in cluster formation.
Robustness across multiple benchmark datasets.
Abstract
In unsupervised feature learning, sample specificity based methods ignore the inter-class information, which deteriorates the discriminative capability of representation models. Clustering based methods are error-prone to explore the complete class boundary information due to the inevitable class inconsistent samples in each cluster. In this work, we propose a novel clustering based method, which, by iteratively excluding class inconsistent samples during progressive cluster formation, alleviates the impact of noise samples in a simple-yet-effective manner. Our approach, referred to as Progressive Cluster Purification (PCP), implements progressive clustering by gradually reducing the number of clusters during training, while the sizes of clusters continuously expand consistently with the growth of model representation capability. With a well-designed cluster purification mechanism, it…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDomain Adaptation and Few-Shot Learning · Anomaly Detection Techniques and Applications · Machine Learning and Data Classification
