Unifying Boxplots: A Multiple Testing Perspective
Bowen Gang, Hongmei Lin, Tiejun Tong

TL;DR
This paper presents a unified, multiple testing-based framework for boxplots, connecting classic and modern outlier detection methods, and introduces a new FDR-controlled boxplot variant for more adaptive outlier identification.
Contribution
It recasts boxplots as multiple testing procedures, systematizes existing methods, and introduces a novel FDR-based boxplot for improved outlier detection.
Findings
Classic boxplot corresponds to an unadjusted multiple testing procedure.
Sample-size-aware modifications control FWER or PFER.
New FDR-based boxplot offers a more adaptive outlier detection method.
Abstract
Tukey's boxplot is a foundational tool for exploratory data analysis, but its classic outlier-flagging rule does not account for the sample size, and subsequent modifications have often been presented as separate, heuristic adjustments. In this paper, we propose a unifying framework that recasts the boxplot and its variants as graphical implementations of multiple testing procedures. We demonstrate that Tukey's original method is equivalent to an unadjusted procedure, while existing sample-size-aware modifications correspond to controlling the Family-Wise Error Rate (FWER) or the Per-Family Error Rate (PFER). This perspective not only systematizes existing methods but also naturally leads to new, more adaptive constructions. We introduce a boxplot motivated by the False Discovery Rate (FDR), and show how our framework provides a flexible pipeline for integrating state-of-the-art robust…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAnomaly Detection Techniques and Applications · Advanced Statistical Methods and Models · Data Analysis with R
