Towards Reliable Domain Generalization: A New Dataset and Evaluations
Jiao Zhang, Xu-Yao Zhang, Cheng-Lin Liu

TL;DR
This paper introduces a new dataset and evaluation framework for domain generalization in handwritten Chinese character recognition, revealing limitations of current methods and advocating for dynamic evaluation protocols.
Contribution
It proposes a novel DG dataset for HCCR, evaluates existing methods, and highlights the need for dynamic evaluation to improve reliability.
Findings
Existing DG methods perform poorly on the new dataset.
The leave-one-domain-out protocol is unreliable for evaluation.
Dynamic performance evaluation offers more comprehensive insights.
Abstract
There are ubiquitous distribution shifts in the real world. However, deep neural networks (DNNs) are easily biased towards the training set, which causes severe performance degradation when they receive out-of-distribution data. Many methods are studied to train models that generalize under various distribution shifts in the literature of domain generalization (DG). However, the recent DomainBed and WILDS benchmarks challenged the effectiveness of these methods. Aiming at the problems in the existing research, we propose a new domain generalization task for handwritten Chinese character recognition (HCCR) to enrich the application scenarios of DG method research. We evaluate eighteen DG methods on the proposed PaHCC (Printed and Handwritten Chinese Characters) dataset and show that the performance of existing methods on this dataset is still unsatisfactory. Besides, under a designed…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDomain Adaptation and Few-Shot Learning · Text and Document Classification Technologies · Multimodal Machine Learning Applications
