Shapley-Inspired Feature Weighting in $k$-means with No Additional Hyperparameters
Richard J. Fawley, Renato Cordeiro de Amorim

TL;DR
SHARK introduces a parameter-free, Shapley value-based feature weighting method for $k$-means clustering, enhancing robustness and accuracy in noisy or high-dimensional data without additional hyperparameters.
Contribution
The paper presents SHARK, a novel $k$-means extension that uses Shapley values for feature relevance, with a polynomial-time computation and no extra parameters.
Findings
SHARK outperforms existing feature weighting methods in robustness and accuracy.
It effectively identifies relevant features in noisy, high-dimensional data.
The method requires no additional hyperparameter tuning.
Abstract
Clustering algorithms often assume all features contribute equally to the data structure, an assumption that usually fails in high-dimensional or noisy settings. Feature weighting methods can address this, but most require additional parameter tuning. We propose SHARK (Shapley Reweighted -means), a feature-weighted clustering algorithm motivated by the use of Shapley values from cooperative game theory to quantify feature relevance, which requires no additional parameters beyond those in -means. We prove that the -means objective can be decomposed into a sum of per-feature Shapley values, providing an axiomatic foundation for unsupervised feature relevance and reducing Shapley computation from exponential to polynomial time. SHARK iteratively re-weights features by the inverse of their Shapley contribution, emphasising informative dimensions and down-weighting irrelevant ones.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsFace and Expression Recognition · Machine Learning and Data Classification · Data Mining Algorithms and Applications
