An Analytic End-to-End Deep Learning Algorithm based on Collaborative Learning
Sitan Li, Chien Chern Cheah

TL;DR
This paper introduces an analytic convergence analysis for a smooth-activation end-to-end deep learning algorithm, demonstrating its stability, collaborative training benefits, and practical effectiveness in classification and robotic control tasks.
Contribution
It provides the first convergence analysis for smooth-activation neural networks in control, avoiding chattering and gradient issues, and proposes a collaborative learning framework for improved accuracy.
Findings
Successful classification on MNIST dataset
Effective online control of UR5e robot arm
Avoidance of chattering and gradient vanishing
Abstract
In most control applications, theoretical analysis of the systems is crucial in ensuring stability or convergence, so as to ensure safe and reliable operations and also to gain a better understanding of the systems for further developments. However, most current deep learning methods are black-box approaches that are more focused on empirical studies. Recently, some results have been obtained for convergence analysis of end-to end deep learning based on non-smooth ReLU activation functions, which may result in chattering for control tasks. This paper presents a convergence analysis for end-to-end deep learning of fully connected neural networks (FNN) with smooth activation functions. The proposed method therefore avoids any potential chattering problem, and it also does not easily lead to gradient vanishing problems. The proposed End-to-End algorithm trains multiple two-layer fully…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNeural Networks and Applications · Machine Learning and ELM · Brain Tumor Detection and Classification
