Universal approximation properties of shallow quadratic neural networks

Leon Frischauf; Otmar Scherzer; Cong Shi

arXiv:2110.01536·math.NA·May 12, 2022

Universal approximation properties of shallow quadratic neural networks

Leon Frischauf, Otmar Scherzer, Cong Shi

PDF

Open Access

TL;DR

This paper demonstrates that shallow quadratic neural networks are universal approximators, often requiring fewer neurons than standard networks, and shows their effectiveness in clustering tasks like MNIST.

Contribution

It introduces and proves the universality of shallow quadratic neural networks and compares their efficiency to standard networks in approximation and clustering tasks.

Findings

01

Quadratic neural networks require fewer neurons for approximation.

02

They achieve comparable or better clustering performance on MNIST.

03

Convergence rates are established using wavelet and statistical learning theory.

Abstract

In this paper we study shallow neural network functions which are linear combinations of compositions of activation and quadratic functions, replacing standard affine linear functions, often called neurons. We show the universality of this approximation and prove convergence rates results based on the theory of wavelets and statistical learning. We show for simple test cases that this ansatz requires a smaller numbers of neurons than standard affine linear neural networks. Moreover, we investigate the efficiency of this approach for clustering tasks with the MNIST data set. Similar observations are made when comparing deep (multi-layer) networks.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNeural Networks and Applications · Image and Signal Denoising Methods · Blind Source Separation Techniques