Exponential Separations in Symmetric Neural Networks

Aaron Zweig; Joan Bruna

arXiv:2206.01266·cs.LG·December 13, 2022·1 cites

Exponential Separations in Symmetric Neural Networks

Aaron Zweig, Joan Bruna

PDF

Open Access 1 Video

TL;DR

This paper proves that certain symmetric neural network architectures can efficiently approximate specific functions, while others require exponentially larger widths, highlighting fundamental representational differences.

Contribution

It introduces a novel exponential separation result between Relational Networks and DeepSets for symmetric function approximation.

Findings

01

Relational Networks can efficiently approximate certain symmetric functions.

02

DeepSets require exponential width to approximate the same functions.

03

The separation holds under analytic activation functions.

Abstract

In this work we demonstrate a novel separation between symmetric neural network architectures. Specifically, we consider the Relational Network~\parencite{santoro2017simple} architecture as a natural generalization of the DeepSets~\parencite{zaheer2017deep} architecture, and study their representational gap. Under the restriction to analytic activation functions, we construct a symmetric function acting on sets of size $N$ with elements in dimension $D$ , which can be efficiently approximated by the former architecture, but provably requires width exponential in $N$ and $D$ for the latter.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

Exponential Separations in Symmetric Neural Networks· slideslive

Taxonomy

TopicsNeural Networks and Applications · Machine Learning and ELM · Advanced Neural Network Applications