Algorithms for solving optimization problems arising from deep neural   net models: smooth problems

Vyacheslav Kungurtsev; Tomas Pevny

arXiv:1807.00172·math.OC·July 3, 2018·5 cites

Algorithms for solving optimization problems arising from deep neural net models: smooth problems

Vyacheslav Kungurtsev, Tomas Pevny

PDF

Open Access

TL;DR

This paper discusses optimization algorithms for complex nonlinear problems in deep neural networks, highlighting a Newton-based method with negative curvature directions and demonstrating promising results in security anomaly detection.

Contribution

It introduces a Newton-based optimization approach incorporating negative curvature directions for deep neural network training problems.

Findings

01

Effective in security anomaly detection datasets

02

Shows promising numerical results

03

Addresses challenges of nonlinear optimization in deep learning

Abstract

Machine Learning models incorporating multiple layered learning networks have been seen to provide effective models for various classification problems. The resulting optimization problem to solve for the optimal vector minimizing the empirical risk is, however, highly nonlinear. This presents a challenge to application and development of appropriate optimization algorithms for solving the problem. In this paper, we summarize the primary challenges involved and present the case for a Newton-based method incorporating directions of negative curvature, including promising numerical results on data arising from security anomally deetection.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSparse and Compressive Sensing Techniques · Stochastic Gradient Optimization Techniques · Advanced Numerical Analysis Techniques