A Novel DNN Training Framework via Data Sampling and Multi-Task   Optimization

Boyu Zhang; A. K. Qin; Hong Pan; Timos Sellis

arXiv:2007.01016·cs.NE·July 3, 2020

A Novel DNN Training Framework via Data Sampling and Multi-Task Optimization

Boyu Zhang, A. K. Qin, Hong Pan, Timos Sellis

PDF

TL;DR

This paper introduces a new DNN training framework that uses multiple data splits and multi-task optimization to improve training effectiveness and generalization, outperforming traditional methods.

Contribution

The paper proposes a novel training framework that generates multiple data splits and employs multi-task optimization for better DNN training and generalization.

Findings

01

The framework improves training effectiveness by escaping local optima.

02

It enhances generalization through implicit regularization.

03

Experimental results show superiority over conventional training methods.

Abstract

Conventional DNN training paradigms typically rely on one training set and one validation set, obtained by partitioning an annotated dataset used for training, namely gross training set, in a certain way. The training set is used for training the model while the validation set is used to estimate the generalization performance of the trained model as the training proceeds to avoid over-fitting. There exist two major issues in this paradigm. Firstly, the validation set may hardly guarantee an unbiased estimate of generalization performance due to potential mismatching with test data. Secondly, training a DNN corresponds to solve a complex optimization problem, which is prone to getting trapped into inferior local optima and thus leads to undesired training results. To address these issues, we propose a novel DNN training framework. It generates multiple pairs of training and validation…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.