Theoretical Exploration of Solutions of Feedforward ReLU Networks

Changcun Huang

arXiv:2202.01919·cs.LG·November 15, 2022

Theoretical Exploration of Solutions of Feedforward ReLU Networks

Changcun Huang

PDF

Open Access

TL;DR

This paper provides a theoretical framework for understanding feedforward ReLU networks by analyzing their solutions as piecewise linear functions, offering insights into architecture components, overparameterization, and depth advantages.

Contribution

It introduces a universal solution framework for ReLU networks based on affine geometry, explaining architecture components, parameter sharing, and depth benefits.

Findings

01

Solutions for three-layer and deep networks are derived.

02

Interpretations of network components are provided.

03

Explanation of overparameterization via affine transforms.

Abstract

This paper aims to interpret the mechanism of feedforward ReLU networks by exploring their solutions for piecewise linear functions, through the deduction from basic rules. The constructed solution should be universal enough to explain some network architectures of engineering; in order for that, several ways are provided to enhance the solution universality. Some of the consequences of our theories include: Under affine-geometry background, the solutions of both three-layer networks and deep-layer networks are given, particularly for those architectures applied in practice, such as multilayer feedforward neural networks and decoders; We give clear and intuitive interpretations of each component of network architectures; The parameter-sharing mechanism for multi-outputs is investigated; We provide an explanation of overparameterization solutions in terms of affine transforms; Under our…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNeural Networks and Applications