Improved Learning of One-hidden-layer Convolutional Neural Networks with   Overlaps

Simon S. Du; Surbhi Goel

arXiv:1805.07798·cs.LG·June 5, 2018·6 cites

Improved Learning of One-hidden-layer Convolutional Neural Networks with Overlaps

Simon S. Du, Surbhi Goel

PDF

Open Access

TL;DR

This paper introduces a new algorithm for efficiently learning one-hidden-layer convolutional neural networks with overlapping patches, applicable to common computer vision structures, combining isotonic regression and landscape analysis techniques.

Contribution

It presents a novel algorithm that handles overlapping patches in CNNs, advancing provable learning methods for neural networks with complex structures.

Findings

01

Algorithm effectively learns CNNs with overlaps

02

Applicable to general patch structures in vision tasks

03

Provides theoretical insights into non-convex optimization landscapes

Abstract

We propose a new algorithm to learn a one-hidden-layer convolutional neural network where both the convolutional weights and the outputs weights are parameters to be learned. Our algorithm works for a general class of (potentially overlapping) patches, including commonly used structures for computer vision tasks. Our algorithm draws ideas from (1) isotonic regression for learning neural networks and (2) landscape analysis of non-convex matrix factorization problems. We believe these findings may inspire further development in designing provable algorithms for learning neural networks and other complex models.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNeural Networks and Applications · Sparse and Compressive Sensing Techniques · Stochastic Gradient Optimization Techniques