Learning What Data to Learn

Yang Fan; Fei Tian; Tao Qin; Jiang Bian; Tie-Yan Liu

arXiv:1702.08635·cs.LG·March 1, 2017·55 cites

Learning What Data to Learn

Yang Fan, Fei Tian, Tao Qin, Jiang Bian, Tie-Yan Liu

PDF

Open Access

TL;DR

This paper introduces NDF, a deep reinforcement learning framework that adaptively selects training data to improve learning efficiency and achieve comparable accuracy with less data across various neural network models and tasks.

Contribution

The paper presents a novel, generic deep reinforcement learning approach for automatic data selection during training, outperforming heuristic methods and applicable to multiple neural network architectures.

Findings

01

NDF achieves similar accuracy with less data and fewer iterations.

02

NDF is effective across different neural network types and tasks.

03

The approach improves training efficiency in neural network learning.

Abstract

Machine learning is essentially the sciences of playing with data. An adaptive data selection strategy, enabling to dynamically choose different data at various training stages, can reach a more effective model in a more efficient way. In this paper, we propose a deep reinforcement learning framework, which we call \emph{\textbf{N}eural \textbf{D}ata \textbf{F}ilter} (\textbf{NDF}), to explore automatic and adaptive data selection in the training process. In particular, NDF takes advantage of a deep neural network to adaptively select and filter important data instances from a sequential stream of training data, such that the future accumulative reward (e.g., the convergence speed) is maximized. In contrast to previous studies in data selection that is mainly based on heuristic strategies, NDF is quite generic and thus can be widely suitable for many machine learning tasks. Taking…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMachine Learning and Data Classification · Machine Learning and Algorithms · Advanced Neural Network Applications

MethodsStochastic Gradient Descent