Automated Selection of r for the r Largest Order Statistics Approach with Adjustment for Sequential Testing
Brian Bader, Jun Yan, Xuebin Zhang

TL;DR
This paper introduces new sequential testing methods, including bootstrap and entropy-based tests, for selecting the optimal number of order statistics in extreme value analysis, improving accuracy and computational efficiency.
Contribution
It develops novel sequential tests for choosing r in the r largest order statistics approach, with methods that control error rates and enhance computational speed.
Findings
Both tests effectively detect model misspecification in simulations.
The procedures control false discovery and familywise error rates.
Applications demonstrate improved extreme event analysis.
Abstract
The r largest order statistics approach is widely used in extreme value analysis because it may use more information from the data than just the block maxima. In practice, the choice of r is critical. If r is too large, bias can occur; if too small, the variance of the estimator can be high. The limiting distribution of the r largest order statistics, denoted by GEVr, extends that of the block maxima. Two specification tests are proposed to select r sequentially. The first is a score test for the GEVr distribution. Due to the special characteristics of the GEVr distribution, the classical chi-square asymptotics cannot be used. The simplest approach is to use the parametric bootstrap, which is straightforward to implement but computationally expensive. An alternative fast weighted bootstrap or multiplier procedure is developed for computational efficiency. The second test uses the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
