Universal Outlier Hypothesis Testing via Mean- and Median-Based Tests
Bernhard C. Geiger, Tobias Koch, Josipa Mihaljevi\'c, Maximilian Toller

TL;DR
This paper introduces mean- and median-based tests for universal outlier hypothesis testing, achieving optimal error exponents under different growth conditions of outlier sequences, without prior knowledge of distributions.
Contribution
It proposes novel mean- and median-based tests that match the performance of maximum likelihood tests in outlier detection with unknown distributions, under different outlier growth regimes.
Findings
Mean-based test performs well when outliers are sublinear in total sequences.
Median-based test achieves optimal error exponent when outliers are proportional to total sequences.
Introduces the concept of typical error exponent for this setting.
Abstract
Universal outlier hypothesis testing refers to a hypothesis testing problem where one observes a large number of length- sequences -- the majority of which are distributed according to the typical distribution and a small number are distributed according to the outlier distribution -- and one wishes to decide, which of these sequences are outliers without having knowledge of and . In contrast to previous works, in this paper it is assumed that both the number of observation sequences and the number of outlier sequences grow with the sequence length. In this case, the typical distribution can be estimated by computing the mean over all observation sequences, provided that the number of outlier sequences is sublinear in the total number of sequences. It is demonstrated that, in this case, one can achieve the error exponent of the maximum likelihood test…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSARS-CoV-2 detection and testing · Wireless Communication Security Techniques · Distributed Sensor Networks and Detection Algorithms
