Minimax Lower Bounds for Ridge Combinations Including Neural Nets

Jason M. Klusowski; Andrew R. Barron

arXiv:1702.02828·stat.ML·February 10, 2017·1 cites

Minimax Lower Bounds for Ridge Combinations Including Neural Nets

Jason M. Klusowski, Andrew R. Barron

PDF

Open Access

TL;DR

This paper establishes minimax lower bounds for estimating functions using ridge combinations, including neural networks, showing how error rates depend on dimension, sample size, and parameter constraints through information-theoretic analysis.

Contribution

It provides the first minimax lower bounds for ridge combination models, including neural networks, with detailed dependence on dimension, sample size, and parameter norms.

Findings

01

Error rate scales as (d/n)^{fractional} for small d

02

Error rate scales as ((log d)/n)^{fractional} for large d

03

Bounds depend on constraints v_0 and v_1 on parameters

Abstract

Estimation of functions of $d$ variables is considered using ridge combinations of the form $\sum_{k = 1}^{m} c_{1, k} ϕ (\sum_{j = 1}^{d} c_{0, j, k} x_{j} - b_{k})$ where the activation function $ϕ$ is a function with bounded value and derivative. These include single-hidden layer neural networks, polynomials, and sinusoidal models. From a sample of size $n$ of possibly noisy values at random sites $X \in B = [- 1, 1]^{d}$ , the minimax mean square error is examined for functions in the closure of the $ℓ_{1}$ hull of ridge functions with activation $ϕ$ . It is shown to be of order $d / n$ to a fractional power (when $d$ is of smaller order than $n$ ), and to be of order $(lo g d) / n$ to a fractional power (when $d$ is of larger order than $n$ ). Dependence on constraints $v_{0}$ and $v_{1}$ on the $ℓ_{1}$ norms of inner parameter $c_{0}$ and outer…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNeural Networks and Applications · Fuzzy Logic and Control Systems · Face and Expression Recognition