Robust Low-Complexity Randomized Methods for Locating Outliers in Large   Matrices

Xingguo Li; Jarvis Haupt

arXiv:1612.02334·cs.IT·December 12, 2016·1 cites

Robust Low-Complexity Randomized Methods for Locating Outliers in Large Matrices

Xingguo Li, Jarvis Haupt

PDF

Open Access

TL;DR

This paper introduces a randomized two-step framework for accurately identifying outlier columns in large, noisy, or incomplete matrices, with proven theoretical guarantees and demonstrated computational efficiency.

Contribution

It proposes a novel randomized inference method with theoretical sample complexity bounds for outlier detection in large matrices.

Findings

01

The method accurately locates outliers with high probability.

02

The approach is computationally efficient for large-scale data.

03

Theoretical bounds are validated through numerical experiments.

Abstract

This paper examines the problem of locating outlier columns in a large, otherwise low-rank matrix, in settings where {}{the data} are noisy, or where the overall matrix has missing elements. We propose a randomized two-step inference framework, and establish sufficient conditions on the required sample complexities under which these methods succeed (with high probability) in accurately locating the outliers for each task. Comprehensive numerical experimental results are provided to verify the theoretical bounds and demonstrate the computational efficiency of the proposed algorithm.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSparse and Compressive Sensing Techniques · Machine Learning and Algorithms · Face and Expression Recognition