A simple efficient density estimator that enables fast systematic search

Jonathan R. Wells; Kai Ming Ting

arXiv:1707.00783·cs.LG·September 13, 2017·1 cites

A simple efficient density estimator that enables fast systematic search

Jonathan R. Wells, Kai Ming Ting

PDF

Open Access

TL;DR

This paper presents a new simple and efficient density estimator that significantly speeds up systematic subspace search in outlying aspects mining, enabling analysis of large, high-dimensional datasets.

Contribution

The paper introduces a novel density estimator that improves computational efficiency, allowing existing outlying aspects mining methods to handle larger and higher-dimensional datasets.

Findings

01

The new estimator is faster than kernel density estimators.

02

Replacing the density estimator in existing methods greatly increases speed.

03

The approach enables analysis of datasets with thousands of dimensions.

Abstract

This paper introduces a simple and efficient density estimator that enables fast systematic search. To show its advantage over commonly used kernel density estimator, we apply it to outlying aspects mining. Outlying aspects mining discovers feature subsets (or subspaces) that describe how a query stand out from a given dataset. The task demands a systematic search of subspaces. We identify that existing outlying aspects miners are restricted to datasets with small data size and dimensions because they employ kernel density estimator, which is computationally expensive, for subspace assessments. We show that a recent outlying aspects miner can run orders of magnitude faster by simply replacing its density estimator with the proposed density estimator, enabling it to deal with large datasets with thousands of dimensions that would otherwise be impossible.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsInfluenza Virus Research Studies