Neural Networks with Small Weights and Depth-Separation Barriers

Gal Vardi; Ohad Shamir

arXiv:2006.00625·cs.LG·December 29, 2020·5 cites

Neural Networks with Small Weights and Depth-Separation Barriers

Gal Vardi, Ohad Shamir

PDF

Open Access 1 Video

TL;DR

This paper investigates the limitations of neural network expressiveness related to depth and weight bounds, establishing barriers to proving depth separation results beyond depth 4 and linking weight complexity to network depth.

Contribution

It introduces fundamental barriers to depth separation proofs beyond depth 4 and shows that functions requiring large weights can be approximated by networks with polynomially bounded weights at increased depth.

Findings

01

Depth separation results are limited beyond depth 4 due to circuit complexity barriers.

02

Functions with large weights can be approximated by shallower networks with polynomially bounded weights.

03

The paper connects neural network expressiveness to classical circuit complexity problems.

Abstract

In studying the expressiveness of neural networks, an important question is whether there are functions which can only be approximated by sufficiently deep networks, assuming their size is bounded. However, for constant depths, existing results are limited to depths $2$ and $3$ , and achieving results for higher depths has been an important open question. In this paper, we focus on feedforward ReLU networks, and prove fundamental barriers to proving such results beyond depth $4$ , by reduction to open problems and natural-proof barriers in circuit complexity. To show this, we study a seemingly unrelated problem of independent interest: Namely, whether there are polynomially-bounded functions which require super-polynomial weights in order to approximate with constant-depth neural networks. We provide a negative and constructive answer to that question, by showing that if a function can be…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

Neural Networks with Small Weights and Depth-Separation Barriers· slideslive

Taxonomy

TopicsStochastic Gradient Optimization Techniques · Complexity and Algorithms in Graphs · Advanced Graph Neural Networks

Methods*Communicated@Fast*How Do I Communicate to Expedia?