A Study of Unsupervised Evaluation Metrics for Practical and Automatic Domain Adaptation
Minghao Chen, Zepeng Gao, Shuai Zhao, Qibo Qiu, Wenxiao Wang, Binbin, Lin, Xiaofei He

TL;DR
This paper introduces the Augmentation Consistency Metric (ACM), an unsupervised evaluation metric for domain adaptation that assesses model quality without target labels, improving hyper-parameter tuning and model selection.
Contribution
The paper proposes a novel unsupervised evaluation metric, ACM, that addresses key issues in existing metrics and enables automatic hyper-parameter optimization in domain adaptation.
Findings
ACM outperforms previous metrics in detecting negative transfer.
Using ACM for hyper-parameter search yields better results than manual tuning.
Large-scale experiments validate the effectiveness of the proposed metric.
Abstract
Unsupervised domain adaptation (UDA) methods facilitate the transfer of models to target domains without labels. However, these methods necessitate a labeled target validation set for hyper-parameter tuning and model selection. In this paper, we aim to find an evaluation metric capable of assessing the quality of a transferred model without access to target validation labels. We begin with the metric based on mutual information of the model prediction. Through empirical analysis, we identify three prevalent issues with this metric: 1) It does not account for the source structure. 2) It can be easily attacked. 3) It fails to detect negative transfer caused by the over-alignment of source and target features. To address the first two issues, we incorporate source accuracy into the metric and employ a new MLP classifier that is held out during training, significantly improving the result.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDomain Adaptation and Few-Shot Learning · Multimodal Machine Learning Applications
