Improving Baselines in the Wild

Kazuki Irie; Imanol Schlag; R\'obert Csord\'as; J\"urgen Schmidhuber

arXiv:2112.15550·cs.LG·January 3, 2022

Improving Baselines in the Wild

Kazuki Irie, Imanol Schlag, R\'obert Csord\'as, J\"urgen Schmidhuber

PDF

Open Access 1 Repo

TL;DR

This paper provides insights into training strategies and domain correlations in the WILDS benchmark, highlighting the importance of metric-specific validation, hyper-parameter tuning, and domain-label relationships for improving model robustness.

Contribution

The study offers new empirical observations on dataset-specific validation, hyper-parameter sensitivity, and domain-label correlations in WILDS, informing future robustness research.

Findings

01

Separate cross-validation per metric is essential.

02

Weak validation-test correlation complicates model development.

03

Minor hyper-parameter tweaks significantly boost performance.

Abstract

We share our experience with the recently released WILDS benchmark, a collection of ten datasets dedicated to developing models and training strategies which are robust to domain shifts. Several experiments yield a couple of critical observations which we believe are of general interest for any future work on WILDS. Our study focuses on two datasets: iWildCam and FMoW. We show that (1) Conducting separate cross-validation for each evaluation metric is crucial for both datasets, (2) A weak correlation between validation and test performance might make model development difficult for iWildCam, (3) Minor changes in the training of hyper-parameters improve the baseline by a relatively large margin (mainly on FMoW), (4) There is a strong correlation between certain domains and certain target labels (mainly on iWildCam). To the best of our knowledge, no prior work on these datasets has…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

kazuki-irie/fork--wilds-public
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Domain Adaptation and Few-Shot Learning · Machine Learning and Data Classification