The Expressivity and Training of Deep Neural Networks: toward the Edge   of Chaos?

Gege Zhang; Gangwei Li; Ningwei Shen; Weidong Zhang

arXiv:1910.04970·cs.LG·December 24, 2019

The Expressivity and Training of Deep Neural Networks: toward the Edge of Chaos?

Gege Zhang, Gangwei Li, Ningwei Shen, Weidong Zhang

PDF

Open Access

TL;DR

This paper analyzes the expressivity of deep neural networks using a dynamic model and Hilbert space, revealing their evolution toward the edge of chaos with depth, and proposes a new activation function for improved spatial representation.

Contribution

It introduces a quantitative framework for neural network expressivity, analyzes the impact of activation functions and input perturbations, and proposes a Hermite polynomial-based activation for better information transfer.

Findings

01

DNNs tend to evolve toward the edge of chaos as depth increases.

02

The proposed Hermite polynomial-based activation improves spatial representation.

03

Empirical results confirm the theoretical analysis on time series prediction and image classification.

Abstract

Expressivity is one of the most significant issues in assessing neural networks. In this paper, we provide a quantitative analysis of the expressivity for the deep neural network (DNN) from its dynamic model, where the Hilbert space is employed to analyze the convergence and criticality. We study the feature mapping of several widely used activation functions obtained by Hermite polynomials, and find sharp declines or even saddle points in the feature space, which stagnate the information transfer in DNNs. We then present a new activation function design based on the Hermite polynomials for better utilization of spatial representation. Moreover, we analyze the information transfer of DNNs, emphasizing the convergence problem caused by the mismatch between input and topological structure. We also study the effects of input perturbations and regularization operators on critical…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNeural Networks and Applications · Model Reduction and Neural Networks · Neural dynamics and brain function

MethodsHermite Polynomial Activation