Supervised Convex Clustering
Minjie Wang, Tianyi Yao, Genevera I. Allen

TL;DR
This paper introduces Supervised Convex Clustering (SCC), a new method that combines auxiliary supervision with unlabeled data to produce more interpretable clusters, demonstrated through simulations and Alzheimer's Disease genomics case study.
Contribution
It proposes a novel supervised convex clustering method that integrates auxiliary variables and extends to biclustering, enhancing interpretability over traditional unsupervised clustering.
Findings
Identifies new candidate genes related to Alzheimer's Disease.
Discovers novel subtypes of Alzheimer's Disease.
Demonstrates improved interpretability and accuracy in clustering.
Abstract
Clustering has long been a popular unsupervised learning approach to identify groups of similar objects and discover patterns from unlabeled data in many applications. Yet, coming up with meaningful interpretations of the estimated clusters has often been challenging precisely due to its unsupervised nature. Meanwhile, in many real-world scenarios, there are some noisy supervising auxiliary variables, for instance, subjective diagnostic opinions, that are related to the observed heterogeneity of the unlabeled data. By leveraging information from both supervising auxiliary variables and unlabeled data, we seek to uncover more scientifically interpretable group structures that may be hidden by completely unsupervised analyses. In this work, we propose and develop a new statistical pattern discovery method named Supervised Convex Clustering (SCC) that borrows strength from both information…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGene expression and cancer classification · Advanced Clustering Algorithms Research · Bayesian Methods and Mixture Models
