Data Valuation using Reinforcement Learning

Jinsung Yoon; Sercan O. Arik; Tomas Pfister

arXiv:1909.11671·cs.LG·September 27, 2019·45 cites

Data Valuation using Reinforcement Learning

Jinsung Yoon, Sercan O. Arik, Tomas Pfister

PDF

Open Access 2 Repos 1 Video

TL;DR

This paper introduces DVRL, a reinforcement learning-based framework for data valuation that adaptively estimates data importance to improve machine learning tasks such as domain adaptation and robust learning.

Contribution

The paper proposes a novel meta learning framework using reinforcement learning to estimate data value, outperforming existing methods across various applications.

Findings

01

DVRL provides superior data value estimates compared to alternatives.

02

Close to optimal corrupted sample discovery performance.

03

Significant improvements in domain adaptation and robust learning tasks.

Abstract

Quantifying the value of data is a fundamental problem in machine learning. Data valuation has multiple important use cases: (1) building insights about the learning task, (2) domain adaptation, (3) corrupted sample discovery, and (4) robust learning. To adaptively learn data values jointly with the target task predictor model, we propose a meta learning framework which we name Data Valuation using Reinforcement Learning (DVRL). We employ a data value estimator (modeled by a deep neural network) to learn how likely each datum is used in training of the predictor model. We train the data value estimator using a reinforcement signal of the reward obtained on a small validation set that reflects performance on the target task. We demonstrate that DVRL yields superior data value estimates compared to alternative methods across different types of datasets and in a diverse set of application…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

Data Valuation using Reinforcement Learning· slideslive

Taxonomy

TopicsDomain Adaptation and Few-Shot Learning · Machine Learning and Data Classification · Explainable Artificial Intelligence (XAI)