Learning Data Manipulation for Augmentation and Weighting

Zhiting Hu; Bowen Tan; Ruslan Salakhutdinov; Tom Mitchell; Eric P.; Xing

arXiv:1910.12795·cs.LG·October 29, 2019·71 cites

Learning Data Manipulation for Augmentation and Weighting

Zhiting Hu, Bowen Tan, Ruslan Salakhutdinov, Tom Mitchell, Eric P., Xing

PDF

Open Access 2 Repos

TL;DR

This paper introduces a unified gradient-based method for learning various data manipulation techniques, such as augmentation and weighting, by leveraging reinforcement learning principles to improve classification tasks especially in low-data and imbalanced scenarios.

Contribution

It proposes a novel, flexible approach that learns different data manipulation schemes using a single gradient-based algorithm, connecting supervised learning with reinforcement learning.

Findings

01

Significant improvements in image and text classification accuracy.

02

Effective data augmentation via learned text transformation networks.

03

Enhanced performance in low-data and class-imbalance settings.

Abstract

Manipulating data, such as weighting data examples or augmenting with new instances, has been increasingly used to improve model training. Previous work has studied various rule- or learning-based approaches designed for specific types of data manipulation. In this work, we propose a new method that supports learning different manipulation schemes with the same gradient-based algorithm. Our approach builds upon a recent connection of supervised learning and reinforcement learning (RL), and adapts an off-the-shelf reward learning algorithm from RL for joint data manipulation learning and model training. Different parameterization of the "data reward" function instantiates different manipulation schemes. We showcase data augmentation that learns a text transformation network, and data weighting that dynamically adapts the data sample importance. Experiments show the resulting algorithms…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMachine Learning and Data Classification · Imbalanced Data Classification Techniques · Anomaly Detection Techniques and Applications