On a Notion of Outliers Based on Ratios of Order Statistics
Ahmet Zahid Balc{\i}o\u{g}lu, O\u{g}uz G\"urerk

TL;DR
This paper introduces a new, robust method for identifying outliers based on ratios of order statistics, which effectively distinguishes tail behaviors in distributions without prior assumptions.
Contribution
It proposes a novel outlier detection statistic using ratios of partial sums of order statistics, with an automatic cutoff algorithm for practical application.
Findings
Successfully distinguishes Pareto tails outside Lévy stable region
Automatically selects outlier cutoff points
Effective in simulation studies
Abstract
There are a number of mathematical formalisms of the term "outlier" in statistics, though there is no consensus on what the right notion ought to be. Accordingly, we try to give a consistent and robust definition for a specific type of outliers defined via order statistics. Our approach is based on ratios of partial sums of order statistics to investigate the tail behaviors of hypothetical and empirical distributions. We simulate our statistic on a set of distributions to mark potential outliers and use an algorithm to automatically select a cut-off point without the need of any further a priori assumption. Finally, we show the efficacy of our statistic by a simulation study on distinguishing two Pareto tails outside of the L\'{e}vy stable region.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsProbabilistic and Robust Engineering Design · Advanced Statistical Methods and Models · Statistical Distribution Estimation and Applications
