Multi-Grade Deep Learning

Yuesheng Xu

arXiv:2302.00150·cs.LG·February 2, 2023·1 cites

Multi-Grade Deep Learning

Yuesheng Xu

PDF

Open Access

TL;DR

This paper introduces a multi-grade learning approach inspired by human education, which decomposes deep neural network training into successive shallow network optimizations, improving efficiency and robustness.

Contribution

The paper proposes a novel multi-grade learning framework that divides deep network training into smaller, manageable optimization problems, reducing nonconvexity and enhancing effectiveness.

Findings

01

Outperforms traditional single-grade models in experiments

02

Reduces training complexity for deep neural networks

03

Enhances robustness of the learning process

Abstract

The current deep learning model is of a single-grade, that is, it learns a deep neural network by solving a single nonconvex optimization problem. When the layer number of the neural network is large, it is computationally challenging to carry out such a task efficiently. Inspired by the human education process which arranges learning in grades, we propose a multi-grade learning model: We successively solve a number of optimization problems of small sizes, which are organized in grades, to learn a shallow neural network for each grade. Specifically, the current grade is to learn the leftover from the previous grade. In each of the grades, we learn a shallow neural network stacked on the top of the neural network, learned in the previous grades, which remains unchanged in training of the current and future grades. By dividing the task of learning a deep neural network into learning…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMachine Learning and ELM · Stochastic Gradient Optimization Techniques · Sparse and Compressive Sensing Techniques