Successive Affine Learning for Deep Neural Networks

Yuesheng Xu

arXiv:2305.07996·cs.LG·July 12, 2023·2 cites

Successive Affine Learning for Deep Neural Networks

Yuesheng Xu

PDF

Open Access

TL;DR

This paper proposes a successive affine learning (SAL) model for deep neural networks that simplifies training by solving convex problems for each layer, leading to improved performance over traditional methods.

Contribution

The SAL model introduces a layer-wise convex optimization approach for training DNNs, inspired by human education, and provides theoretical convergence guarantees.

Findings

01

SAL outperforms traditional deep learning models in numerical tests.

02

The model establishes Pythagorean and Parseval identities for the generated system.

03

Convergence theorem shows either finite termination or error norms decrease to a limit.

Abstract

This paper introduces a successive affine learning (SAL) model for constructing deep neural networks (DNNs). Traditionally, a DNN is built by solving a non-convex optimization problem. It is often challenging to solve such a problem numerically due to its non-convexity and having a large number of layers. To address this challenge, inspired by the human education system, the multi-grade deep learning (MGDL) model was recently initiated by the author of this paper. The MGDL model learns a DNN in several grades, in each of which one constructs a shallow DNN consisting of a relatively small number of layers. The MGDL model still requires solving several non-convex optimization problems. The proposed SAL model mutates from the MGDL model. Noting that each layer of a DNN consists of an affine map followed by an activation function, we propose to learn the affine map by solving a…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsStochastic Gradient Optimization Techniques · Sparse and Compressive Sensing Techniques · Machine Learning and ELM