Learning to Select Pivotal Samples for Meta Re-weighting

Yinjun Wu; Adam Stein; Jacob Gardner; Mayur Naik

arXiv:2302.04418·cs.LG·February 10, 2023

Learning to Select Pivotal Samples for Meta Re-weighting

Yinjun Wu, Adam Stein, Jacob Gardner, Mayur Naik

PDF

Open Access 1 Repo 1 Video

TL;DR

This paper introduces a learning framework to identify optimal meta samples from large, imperfect datasets for improved re-weighting in machine learning, using clustering methods to enhance performance.

Contribution

It proposes a novel approach that learns to select meta samples via clustering, addressing the challenge of imperfect data in meta re-weighting.

Findings

01

RBC and GBC outperform baseline methods in experiments.

02

Theoretical analysis supports the clustering approach.

03

Methods improve model robustness with noisy data.

Abstract

Sample re-weighting strategies provide a promising mechanism to deal with imperfect training data in machine learning, such as noisily labeled or class-imbalanced data. One such strategy involves formulating a bi-level optimization problem called the meta re-weighting problem, whose goal is to optimize performance on a small set of perfect pivotal samples, called meta samples. Many approaches have been proposed to efficiently solve this problem. However, all of them assume that a perfect meta sample set is already provided while we observe that the selections of meta sample set is performance critical. In this paper, we study how to learn to identify such a meta sample set from a large, imperfect training set, that is subsequently cleaned and used to optimize performance in the meta re-weighting setting. We propose a learning framework which reduces the meta samples selection problem to…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

thuwuyinjun/meta_sample_selections
pytorchOfficial

Videos

Learning to Select Pivotal Samples for Meta Re-weighting· underline

Taxonomy

TopicsMachine Learning and Data Classification · Face and Expression Recognition · Machine Learning and ELM

Methodsk-Means Clustering