How Many Neurons Does it Take to Approximate the Maximum?

Itay Safran; Daniel Reichman; Paul Valiant

arXiv:2307.09212·cs.LG·November 8, 2023

How Many Neurons Does it Take to Approximate the Maximum?

Itay Safran, Daniel Reichman, Paul Valiant

PDF

Open Access

TL;DR

This paper investigates the minimal size of ReLU neural networks needed to approximate the maximum function over multiple inputs, establishing new bounds and depth separations, with implications for understanding neural network capacity.

Contribution

It provides new lower and upper bounds on network width for maximum function approximation, including depth separation results and a construction with logarithmic depth.

Findings

01

Depth 2 networks require exponential size to approximate the maximum.

02

Depth 3 networks can approximate the maximum with a polynomial number of neurons.

03

New lower bounds for depth 2 networks under exponential weight assumptions.

Abstract

We study the size of a neural network needed to approximate the maximum function over $d$ inputs, in the most basic setting of approximating with respect to the $L_{2}$ norm, for continuous distributions, for a network that uses ReLU activations. We provide new lower and upper bounds on the width required for approximation across various depths. Our results establish new depth separations between depth 2 and 3, and depth 3 and 5 networks, as well as providing a depth $O (lo g (lo g (d)))$ and width $O (d)$ construction which approximates the maximum function. Our depth separation results are facilitated by a new lower bound for depth 2 networks approximating the maximum function over the uniform distribution, assuming an exponential upper bound on the size of the weights. Furthermore, we are able to use this depth 2 lower bound to provide tight bounds on the number of…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNeural Networks and Applications · Machine Learning and Algorithms · Stochastic Gradient Optimization Techniques

MethodsBalanced Selection