Towards Reliable Domain Generalization: A New Dataset and Evaluations

Jiao Zhang; Xu-Yao Zhang; Cheng-Lin Liu

arXiv:2309.06142·cs.CV·September 13, 2023

Towards Reliable Domain Generalization: A New Dataset and Evaluations

Jiao Zhang, Xu-Yao Zhang, Cheng-Lin Liu

PDF

Open Access

TL;DR

This paper introduces a new dataset and evaluation framework for domain generalization in handwritten Chinese character recognition, revealing limitations of current methods and advocating for dynamic evaluation protocols.

Contribution

It proposes a novel DG dataset for HCCR, evaluates existing methods, and highlights the need for dynamic evaluation to improve reliability.

Findings

01

Existing DG methods perform poorly on the new dataset.

02

The leave-one-domain-out protocol is unreliable for evaluation.

03

Dynamic performance evaluation offers more comprehensive insights.

Abstract

There are ubiquitous distribution shifts in the real world. However, deep neural networks (DNNs) are easily biased towards the training set, which causes severe performance degradation when they receive out-of-distribution data. Many methods are studied to train models that generalize under various distribution shifts in the literature of domain generalization (DG). However, the recent DomainBed and WILDS benchmarks challenged the effectiveness of these methods. Aiming at the problems in the existing research, we propose a new domain generalization task for handwritten Chinese character recognition (HCCR) to enrich the application scenarios of DG method research. We evaluate eighteen DG methods on the proposed PaHCC (Printed and Handwritten Chinese Characters) dataset and show that the performance of existing methods on this dataset is still unsatisfactory. Besides, under a designed…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsDomain Adaptation and Few-Shot Learning · Text and Document Classification Technologies · Multimodal Machine Learning Applications