Neural networks with linear threshold activations: structure and   algorithms

Sammy Khalife; Hongyu Cheng; Amitabh Basu

arXiv:2111.08117·cs.LG·October 20, 2023

Neural networks with linear threshold activations: structure and algorithms

Sammy Khalife, Hongyu Cheng, Amitabh Basu

PDF

Open Access

TL;DR

This paper characterizes the functions representable by neural networks with linear threshold activations, establishes the necessity of two hidden layers, and introduces a polynomial-time algorithm for global optimization, along with a new neural network class.

Contribution

It provides a precise characterization of linear threshold neural networks, proves the necessity of two hidden layers, and develops a universal polynomial-time algorithm for empirical risk minimization.

Findings

01

Two hidden layers are necessary and sufficient for representing certain functions.

02

A polynomial-time algorithm for global ERM optimization in fixed architecture neural networks.

03

Introduction of shortcut linear threshold networks with desirable theoretical properties.

Abstract

In this article we present new results on neural networks with linear threshold activation functions. We precisely characterize the class of functions that are representable by such neural networks and show that 2 hidden layers are necessary and sufficient to represent any function representable in the class. This is a surprising result in the light of recent exact representability investigations for neural networks using other popular activation functions like rectified linear units (ReLU). We also give precise bounds on the sizes of the neural networks required to represent any function in the class. Finally, we design an algorithm to solve the empirical risk minimization (ERM) problem to global optimality for these neural networks with a fixed architecture. The algorithm's running time is polynomial in the size of the data sample, if the input dimension and the size of the network…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNeural Networks and Applications · Machine Learning and Algorithms · Fault Detection and Control Systems