Porcupine Neural Networks: (Almost) All Local Optima are Global

Soheil Feizi; Hamid Javadi; Jesse Zhang; David Tse

arXiv:1710.02196·stat.ML·October 9, 2017·26 cites

Porcupine Neural Networks: (Almost) All Local Optima are Global

Soheil Feizi, Hamid Javadi, Jesse Zhang, David Tse

PDF

Open Access 1 Repo

TL;DR

This paper introduces Porcupine Neural Networks, constraining weights to finite lines, which ensures most local optima are global and can approximate unconstrained networks effectively.

Contribution

It proposes a novel constrained neural network model with favorable optimization landscape properties and demonstrates its approximation capabilities for standard neural networks.

Findings

01

Most local optima of PNNs are global

02

Regions with bad local optima are characterized

03

PNNs can approximate unconstrained networks polynomially

Abstract

Neural networks have been used prominently in several machine learning and statistics applications. In general, the underlying optimization of neural networks is non-convex which makes their performance analysis challenging. In this paper, we take a novel approach to this problem by asking whether one can constrain neural network weights to make its optimization landscape have good theoretical properties while at the same time, be a good approximation for the unconstrained one. For two-layer neural networks, we provide affirmative answers to these questions by introducing Porcupine Neural Networks (PNNs) whose weight vectors are constrained to lie over a finite set of lines. We show that most local optima of PNN optimizations are global while we have a characterization of regions where bad local optimizers may exist. Moreover, our theoretical and empirical results suggest that an…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

jessemzhang/porcupine_neural_networks
tf

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNeural Networks and Applications · Machine Learning and Algorithms · Stochastic Gradient Optimization Techniques