Statistical Inference Using Mean Shift Denoising

Yunhua Xiang; Yen-Chi Chen

arXiv:1610.03927·stat.ME·October 14, 2016·2 cites

Statistical Inference Using Mean Shift Denoising

Yunhua Xiang, Yen-Chi Chen

PDF

Open Access

TL;DR

This paper explores the use of the mean shift algorithm as a denoising tool, analyzing its effects on data distribution and demonstrating improvements in clustering, hypothesis testing, and anomaly detection.

Contribution

It introduces a novel framework viewing mean shift as a distribution operator, providing theoretical insights and practical benefits for data analysis tasks.

Findings

01

Mean shift concentrates data around high-density regions.

02

Denoising with mean shift improves clustering performance.

03

Enhances power of two-sample tests and aids anomaly detection.

Abstract

In this paper, we study how the mean shift algorithm can be used to denoise a dataset. We introduce a new framework to analyze the mean shift algorithm as a denoising approach by viewing the algorithm as an operator on a distribution function. We investigate how the mean shift algorithm changes the distribution and show that data points shifted by the mean shift concentrate around high density regions of the underlying density function. By using the mean shift as a denoising method, we enhance the performance of several clustering techniques, improve the power of two-sample tests, and obtain a new method for anomaly detection.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAnomaly Detection Techniques and Applications · Time Series Analysis and Forecasting · Image and Signal Denoising Methods