An Analytic End-to-End Deep Learning Algorithm based on Collaborative   Learning

Sitan Li; Chien Chern Cheah

arXiv:2305.18594·cs.LG·June 1, 2023·1 cites

An Analytic End-to-End Deep Learning Algorithm based on Collaborative Learning

Sitan Li, Chien Chern Cheah

PDF

Open Access

TL;DR

This paper introduces an analytic convergence analysis for a smooth-activation end-to-end deep learning algorithm, demonstrating its stability, collaborative training benefits, and practical effectiveness in classification and robotic control tasks.

Contribution

It provides the first convergence analysis for smooth-activation neural networks in control, avoiding chattering and gradient issues, and proposes a collaborative learning framework for improved accuracy.

Findings

01

Successful classification on MNIST dataset

02

Effective online control of UR5e robot arm

03

Avoidance of chattering and gradient vanishing

Abstract

In most control applications, theoretical analysis of the systems is crucial in ensuring stability or convergence, so as to ensure safe and reliable operations and also to gain a better understanding of the systems for further developments. However, most current deep learning methods are black-box approaches that are more focused on empirical studies. Recently, some results have been obtained for convergence analysis of end-to end deep learning based on non-smooth ReLU activation functions, which may result in chattering for control tasks. This paper presents a convergence analysis for end-to-end deep learning of fully connected neural networks (FNN) with smooth activation functions. The proposed method therefore avoids any potential chattering problem, and it also does not easily lead to gradient vanishing problems. The proposed End-to-End algorithm trains multiple two-layer fully…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNeural Networks and Applications · Machine Learning and ELM · Brain Tumor Detection and Classification