Distributionally Robust Coreset Selection under Covariate Shift

Tomonari Tanaka; Hiroyuki Hanada; Hanting Yang; Tatsuya Aoyama; Yu; Inatsu; Satoshi Akahane; Yoshito Okura; Noriaki Hashimoto; Taro Murayama,; Hanju Lee; Shinya Kojima; Ichiro Takeuchi

arXiv:2501.14253·stat.ML·February 19, 2025

Distributionally Robust Coreset Selection under Covariate Shift

Tomonari Tanaka, Hiroyuki Hanada, Hanting Yang, Tatsuya Aoyama, Yu, Inatsu, Satoshi Akahane, Yoshito Okura, Noriaki Hashimoto, Taro Murayama,, Hanju Lee, Shinya Kojima, Ichiro Takeuchi

PDF

Open Access

TL;DR

This paper introduces Distributionally Robust Coreset Selection (DRCS), a method for selecting training data that remains effective under unknown distribution shifts, especially covariate shift, improving robustness in deployment.

Contribution

The paper proposes DRCS, a novel coreset selection method that minimizes an upper bound on worst-case test error under distributional shifts, applicable to both convex and deep learning models.

Findings

01

DRCS effectively reduces worst-case test error under covariate shift.

02

Experimental results show improved robustness compared to traditional coreset methods.

03

The approach is adaptable to deep learning with suitable approximations.

Abstract

Coreset selection, which involves selecting a small subset from an existing training dataset, is an approach to reducing training data, and various approaches have been proposed for this method. In practical situations where these methods are employed, it is often the case that the data distributions differ between the development phase and the deployment phase, with the latter being unknown. Thus, it is challenging to select an effective subset of training data that performs well across all deployment scenarios. We therefore propose Distributionally Robust Coreset Selection (DRCS). DRCS theoretically derives an estimate of the upper bound for the worst-case test error, assuming that the future covariate distribution may deviate within a defined range from the training distribution. Furthermore, by selecting instances in a way that suppresses the estimate of the upper bound for the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsFace and Expression Recognition

MethodsFocus