An Algorithm for Training Polynomial Networks

Roi Livni; Shai Shalev-Shwartz; Ohad Shamir

arXiv:1304.7045·cs.LG·February 21, 2014·34 cites

An Algorithm for Training Polynomial Networks

Roi Livni, Shai Shalev-Shwartz, Ohad Shamir

PDF

Open Access

TL;DR

This paper introduces the Basis Learner, an efficient layer-by-layer algorithm for training deep polynomial networks with quadratic nodes, capable of reducing training error to zero and outperforming shallow polynomial models.

Contribution

The paper presents a novel universal training algorithm for deep quadratic networks, with practical implementations and experimental validation.

Findings

01

Training error decreases at each iteration

02

Algorithm can reach zero training error under mild conditions

03

Deep polynomial networks outperform shallow models in experiments

Abstract

We consider deep neural networks, in which the output of each node is a quadratic function of its inputs. Similar to other deep architectures, these networks can compactly represent any function on a finite training set. The main goal of this paper is the derivation of an efficient layer-by-layer algorithm for training such networks, which we denote as the \emph{Basis Learner}. The algorithm is a universal learner in the sense that the training error is guaranteed to decrease at every iteration, and can eventually reach zero under mild conditions. We present practical implementations of this algorithm, as well as preliminary experimental results. We also compare our deep architecture to other shallow architectures for learning polynomials, in particular kernel learning.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNeural Networks and Applications · Model Reduction and Neural Networks · Machine Learning and Algorithms