Cluster-Adaptive Feature Extraction and its Theoretical Foundation with Minkowski Weighted k-Means
Renato Cordeiro de Amorim, Vladimir Makarenkov

TL;DR
This paper introduces a theoretical foundation for Minkowski weighted k-means, revealing how feature weights depend on dispersion, and proposes Cluster-Adaptive Feature Extraction (CAFE) to enhance unsupervised feature extraction.
Contribution
The paper provides a theoretical analysis of mwk-means, deriving bounds and structure of feature weights, and introduces CAFE, a novel method leveraging these insights for improved feature extraction.
Findings
Theoretical bounds for mwk-means objective function.
Feature weights depend on relative dispersion following a power-law.
CAFE improves traditional feature extraction methods in noisy, unsupervised settings.
Abstract
The Minkowski weighted -means (-means) algorithm extends classical -means by incorporating feature weights and a Minkowski distance. We first show that the -means objective can be expressed as a power-mean aggregation of within-cluster dispersions, with the order determined by the Minkowski exponent . This formulation reveals how controls the transition between selective and uniform use of features. Using this representation, we derive bounds for the objective function and characterise the structure of the feature weights, showing that they depend only on relative dispersion and follow a power-law relationship with dispersion ratios. This leads to explicit guarantees on the suppression of high-dispersion features, and we establish convergence of the algorithm. Building on these theoretical results, we introduce Cluster-Adaptive Feature Extraction (CAFE), a method…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
