Learning What Data to Learn
Yang Fan, Fei Tian, Tao Qin, Jiang Bian, Tie-Yan Liu

TL;DR
This paper introduces NDF, a deep reinforcement learning framework that adaptively selects training data to improve learning efficiency and achieve comparable accuracy with less data across various neural network models and tasks.
Contribution
The paper presents a novel, generic deep reinforcement learning approach for automatic data selection during training, outperforming heuristic methods and applicable to multiple neural network architectures.
Findings
NDF achieves similar accuracy with less data and fewer iterations.
NDF is effective across different neural network types and tasks.
The approach improves training efficiency in neural network learning.
Abstract
Machine learning is essentially the sciences of playing with data. An adaptive data selection strategy, enabling to dynamically choose different data at various training stages, can reach a more effective model in a more efficient way. In this paper, we propose a deep reinforcement learning framework, which we call \emph{\textbf{N}eural \textbf{D}ata \textbf{F}ilter} (\textbf{NDF}), to explore automatic and adaptive data selection in the training process. In particular, NDF takes advantage of a deep neural network to adaptively select and filter important data instances from a sequential stream of training data, such that the future accumulative reward (e.g., the convergence speed) is maximized. In contrast to previous studies in data selection that is mainly based on heuristic strategies, NDF is quite generic and thus can be widely suitable for many machine learning tasks. Taking…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning and Data Classification · Machine Learning and Algorithms · Advanced Neural Network Applications
MethodsStochastic Gradient Descent
