Global Sequential Testing for Multi-Stream Auditing

Beepul Bharti; Ambar Pal; Jeremias Sulam

arXiv:2602.21479·stat.ML·February 26, 2026

Global Sequential Testing for Multi-Stream Auditing

Beepul Bharti, Ambar Pal, Jeremias Sulam

PDF

Open Access

TL;DR

This paper introduces new sequential testing methods for multi-stream auditing that improve detection efficiency under various alternative hypotheses, balancing false alarm control and rapid detection.

Contribution

It develops merging test martingales and a balanced test that optimize expected stopping times for global hypothesis testing across multiple data streams.

Findings

01

Balanced test achieves $O(\frac{1}{k}\ln\frac{1}{\alpha})$ expected stopping time in dense settings.

02

Proposed tests outperform standard Bonferroni-based methods in simulations.

03

Effective on both synthetic and real-world datasets.

Abstract

Across many risk-sensitive areas, it is critical to continuously audit the performance of machine learning systems and detect any unusual behavior quickly. This can be modeled as a sequential hypothesis testing problem with $k$ incoming streams of data and a global null hypothesis that asserts that the system is working as expected across all $k$ streams. The standard global test employs a Bonferroni correction and has an expected stopping time bound of $O (ln \frac{k}{α})$ when $k$ is large and the significance level of the test, $α$ , is small. In this work, we construct new sequential tests by using ideas of merging test martingales with different trade-offs in expected stopping times under different, sparse or dense alternative hypotheses. We further derive a new, balanced test that achieves an improved expected stopping time bound that matches Bonferroni's in the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsData Stream Mining Techniques · Imbalanced Data Classification Techniques · Machine Learning and Algorithms