K-means Clustering Based Feature Consistency Alignment for Label-free Model Evaluation
Shuyu Miao, Lin Zheng, Jingjing Liu, and Hong Jin

TL;DR
This paper introduces a novel label-free model evaluation method using K-means clustering to align feature distributions and predict model performance without ground truth labels, demonstrating significant improvements in accuracy and robustness.
Contribution
We propose K-means Clustering Based Feature Consistency Alignment (KCFCA), a new approach for label-free model evaluation that handles distribution shifts and enhances performance prediction.
Findings
Secured 2nd place on DataCV Challenge leaderboard.
Achieved 36% improvement over baseline RMSE.
Produced more robust and optimal single model performance.
Abstract
The label-free model evaluation aims to predict the model performance on various test sets without relying on ground truths. The main challenge of this task is the absence of labels in the test data, unlike in classical supervised model evaluation. This paper presents our solutions for the 1st DataCV Challenge of the Visual Dataset Understanding workshop at CVPR 2023. Firstly, we propose a novel method called K-means Clustering Based Feature Consistency Alignment (KCFCA), which is tailored to handle the distribution shifts of various datasets. KCFCA utilizes the K-means algorithm to cluster labeled training sets and unlabeled test sets, and then aligns the cluster centers with feature consistency. Secondly, we develop a dynamic regression model to capture the relationship between the shifts in distribution and model accuracy. Thirdly, we design an algorithm to discover the outlier model…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning and Data Classification · Domain Adaptation and Few-Shot Learning · Anomaly Detection Techniques and Applications
MethodsTest · k-Means Clustering
