Unifying Boxplots: A Multiple Testing Perspective

Bowen Gang; Hongmei Lin; Tiejun Tong

arXiv:2510.20259·stat.ME·October 24, 2025

Unifying Boxplots: A Multiple Testing Perspective

Bowen Gang, Hongmei Lin, Tiejun Tong

PDF

Open Access

TL;DR

This paper presents a unified, multiple testing-based framework for boxplots, connecting classic and modern outlier detection methods, and introduces a new FDR-controlled boxplot variant for more adaptive outlier identification.

Contribution

It recasts boxplots as multiple testing procedures, systematizes existing methods, and introduces a novel FDR-based boxplot for improved outlier detection.

Findings

01

Classic boxplot corresponds to an unadjusted multiple testing procedure.

02

Sample-size-aware modifications control FWER or PFER.

03

New FDR-based boxplot offers a more adaptive outlier detection method.

Abstract

Tukey's boxplot is a foundational tool for exploratory data analysis, but its classic outlier-flagging rule does not account for the sample size, and subsequent modifications have often been presented as separate, heuristic adjustments. In this paper, we propose a unifying framework that recasts the boxplot and its variants as graphical implementations of multiple testing procedures. We demonstrate that Tukey's original method is equivalent to an unadjusted procedure, while existing sample-size-aware modifications correspond to controlling the Family-Wise Error Rate (FWER) or the Per-Family Error Rate (PFER). This perspective not only systematizes existing methods but also naturally leads to new, more adaptive constructions. We introduce a boxplot motivated by the False Discovery Rate (FDR), and show how our framework provides a flexible pipeline for integrating state-of-the-art robust…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAnomaly Detection Techniques and Applications · Advanced Statistical Methods and Models · Data Analysis with R