Residual Networks: Lyapunov Stability and Convex Decomposition

Kamil Nar; Shankar Sastry

arXiv:1803.08203·cs.LG·March 23, 2018·1 cites

Residual Networks: Lyapunov Stability and Convex Decomposition

Kamil Nar, Shankar Sastry

PDF

Open Access

TL;DR

This paper explains why residual networks avoid degradation with depth by analyzing Lyapunov stability and introduces a convex-decomposition architecture that enhances generalization and stability.

Contribution

It demonstrates Lyapunov stability as a key factor in residual networks' performance and proposes a novel architecture for function approximation via convex decomposition.

Findings

01

Residual networks maintain stability regardless of depth.

02

The convex-decomposition architecture effectively approximates complex functions.

03

Parameters that change little during training help prevent overfitting.

Abstract

While training error of most deep neural networks degrades as the depth of the network increases, residual networks appear to be an exception. We show that the main reason for this is the Lyapunov stability of the gradient descent algorithm: for an arbitrarily chosen step size, the equilibria of the gradient descent are most likely to remain stable for the parametrization of residual networks. We then present an architecture with a pair of residual networks to approximate a large class of functions by decomposing them into a convex and a concave part. Some parameters of this model are shown to change little during training, and this imperfect optimization prevents overfitting the data and leads to solutions with small Lipschitz constants, while providing clues about the generalization of other deep networks.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsGraph theory and applications · Gene Regulatory Network Analysis · Distributed Control Multi-Agent Systems