Constant Time EXPected Similarity Estimation using Stochastic   Optimization

Markus Schneider; Wolfgang Ertel; G\"unther Palm

arXiv:1511.05371·cs.LG·November 18, 2015·1 cites

Constant Time EXPected Similarity Estimation using Stochastic Optimization

Markus Schneider, Wolfgang Ertel, G\"unther Palm

PDF

Open Access

TL;DR

This paper introduces an improved version of the EXPected Similarity Estimation (EXPoSE) algorithm that achieves constant-time prediction accuracy for large-scale anomaly detection by using stochastic optimization.

Contribution

It reformulates EXPoSE as a stochastic optimization problem, enabling epsilon-accurate models to be estimated in constant time regardless of dataset size.

Findings

01

Achieves epsilon-accurate models in constant time

02

Applicable to infinite-dimensional Hilbert spaces

03

No additional step-size parameters needed

Abstract

A new algorithm named EXPected Similarity Estimation (EXPoSE) was recently proposed to solve the problem of large-scale anomaly detection. It is a non-parametric and distribution free kernel method based on the Hilbert space embedding of probability measures. Given a dataset of $n$ samples, EXPoSE needs only $O (n)$ (linear time) to build a model and $O (1)$ (constant time) to make a prediction. In this work we improve the linear computational complexity and show that an $ϵ$ -accurate model can be estimated in constant time, which has significant implications for large-scale learning problems. To achieve this goal, we cast the original EXPoSE formulation into a stochastic optimization problem. It is crucial that this approach allows us to determine the number of iteration based on a desired accuracy $ϵ$ , independent of the dataset size $n$ . We will show…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsDomain Adaptation and Few-Shot Learning · Machine Learning and Algorithms · Sparse and Compressive Sensing Techniques