PROTOCOL: Partial Optimal Transport-enhanced Contrastive Learning for Imbalanced Multi-view Clustering
Xuqian Xue, Yiming Lei, Qi Cai, Hongming Shan, Junping Zhang

TL;DR
This paper introduces PROTOCOL, a novel contrastive learning framework that addresses class imbalance in multi-view clustering by leveraging partial optimal transport and class-rebalanced contrastive techniques, significantly improving performance.
Contribution
It pioneers the systematic study of imbalanced multi-view clustering and proposes a new framework combining partial optimal transport with contrastive learning to handle class imbalance.
Findings
Significantly improves clustering on imbalanced multi-view datasets.
Effectively perceives class imbalance via partial optimal transport.
Enhances minority sample representation through class-rebalanced contrastive learning.
Abstract
While contrastive multi-view clustering has achieved remarkable success, it implicitly assumes balanced class distribution. However, real-world multi-view data primarily exhibits class imbalance distribution. Consequently, existing methods suffer performance degradation due to their inability to perceive and model such imbalance. To address this challenge, we present the first systematic study of imbalanced multi-view clustering, focusing on two fundamental problems: i. perceiving class imbalance distribution, and ii. mitigating representation degradation of minority samples. We propose PROTOCOL, a novel PaRtial Optimal TranspOrt-enhanced COntrastive Learning framework for imbalanced multi-view clustering. First, for class imbalance perception, we map multi-view features into a consensus space and reformulate the imbalanced clustering as a partial optimal transport (POT) problem,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsImbalanced Data Classification Techniques · Text and Document Classification Technologies · Data-Driven Disease Surveillance
