Data-adaptive trimming of the Hill estimator and detection of outliers   in the extremes of heavy-tailed data

Shrijita Bhattacharya; Michael Kallitsis; Stilian Stoev

arXiv:1808.07704·stat.ME·August 24, 2018

Data-adaptive trimming of the Hill estimator and detection of outliers in the extremes of heavy-tailed data

Shrijita Bhattacharya, Michael Kallitsis, Stilian Stoev

PDF

TL;DR

This paper proposes a robust, data-adaptive trimming method for the Hill estimator to accurately estimate the tail index of heavy-tailed distributions and detect outliers in extreme data, improving robustness and efficiency.

Contribution

It introduces a novel, adaptive trimming approach for the Hill estimator that is robust to outliers and contamination, with proven asymptotic properties and practical outlier detection capabilities.

Findings

01

Estimator is asymptotically normal under second order regular variation.

02

Method is minimax rate-optimal in the Hall class of distributions.

03

Successfully identifies outliers in real heavy-tailed data sets.

Abstract

We introduce a trimmed version of the Hill estimator for the index of a heavy-tailed distribution, which is robust to perturbations in the extreme order statistics. In the ideal Pareto setting, the estimator is essentially finite-sample efficient among all unbiased estimators with a given strict upper break-down point. For general heavy-tailed models, we establish the asymptotic normality of the estimator under second order regular variation conditions and also show it is minimax rate-optimal in the Hall class of distributions. We also develop an automatic, data-driven method for the choice of the trimming parameter which yields a new type of robust estimator that can adapt to the unknown level of contamination in the extremes. This adaptive robustness property makes our estimator particularly appealing and superior to other robust estimators in the setting where the extremes of the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.