Model Assessment and Selection under Temporal Distribution Shift

Elise Han; Chengpiao Huang; Kaizheng Wang

arXiv:2402.08672·cs.LG·June 5, 2024·1 cites

Model Assessment and Selection under Temporal Distribution Shift

Elise Han, Chengpiao Huang, Kaizheng Wang

PDF

Open Access 1 Repo

TL;DR

This paper proposes an adaptive method for model assessment and selection under temporal distribution shifts by synthesizing datasets, estimating generalization errors, and using a tournament approach, with strong theoretical and experimental support.

Contribution

It introduces a novel adaptive rolling window approach for error estimation and a tournament-based model selection method tailored for non-stationary environments.

Findings

01

The method effectively estimates generalization error under distribution shift.

02

The tournament approach achieves near-optimal model selection.

03

The approach is supported by theoretical analysis and numerical experiments.

Abstract

We investigate model assessment and selection in a changing environment, by synthesizing datasets from both the current time period and historical epochs. To tackle unknown and potentially arbitrary temporal distribution shift, we develop an adaptive rolling window approach to estimate the generalization error of a given model. This strategy also facilitates the comparison between any two candidate models by estimating the difference of their generalization errors. We further integrate pairwise comparisons into a single-elimination tournament, achieving near-optimal model selection from a collection of candidates. Theoretical analyses and numerical experiments demonstrate the adaptivity of our proposed methods to the non-stationarity in data.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

eliselyhan/arw
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSimulation Techniques and Applications