Stability of the Stochastic Gradient Method for an Approximated Large   Scale Kernel Machine

Aven Samareh; Mahshid Salemi Parizi

arXiv:1804.08003·eess.SP·April 24, 2018

Stability of the Stochastic Gradient Method for an Approximated Large Scale Kernel Machine

Aven Samareh, Mahshid Salemi Parizi

PDF

Open Access

TL;DR

This paper analyzes the stability and generalization performance of stochastic gradient methods when used with approximated kernel functions via random Fourier features, demonstrating theoretical stability and empirical validation.

Contribution

It provides a theoretical analysis of the stability of stochastic gradient methods for approximated kernel machines and empirically verifies the results across multiple datasets.

Findings

01

SGM is stable for approximated kernel functions under certain conditions.

02

High probability bounds on generalization error are established.

03

Empirical results confirm theoretical stability and generalization performance.

Abstract

In this paper we measured the stability of stochastic gradient method (SGM) for learning an approximated Fourier primal support vector machine. The stability of an algorithm is considered by measuring the generalization error in terms of the absolute difference between the test and the training error. Our problem is to learn an approximated kernel function using random Fourier features for a binary classification problem via online convex optimization settings. For a convex, Lipschitz continuous and smooth loss function, given reasonable number of iterations stochastic gradient method is stable. We showed that with a high probability SGM generalizes well for an approximated kernel under given assumptions.We empirically verified the theoretical findings for different parameters using several data sets.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsStochastic Gradient Optimization Techniques · Sparse and Compressive Sensing Techniques · Machine Learning and ELM