Algorithms and Fundamental Limits for Unlabeled Detection using Types
Stefano Marano, Peter Willett

TL;DR
This paper investigates the fundamental limits of binary hypothesis testing with unlabeled data, establishing error exponents, and proposes practical low-complexity algorithms for detection without label estimation.
Contribution
It introduces the first theoretical analysis of detection limits with unlabeled data and develops new algorithms with provable efficiency for practical implementation.
Findings
Established fundamental error exponent limits for unlabeled detection.
Proposed low-complexity algorithms with ${\cal O}(n^2)$ worst-case complexity.
Validated algorithms through computer experiments.
Abstract
Emerging applications of sensor networks for detection sometimes suggest that classical problems ought be revisited under new assumptions. This is the case of binary hypothesis testing with independent - but not necessarily identically distributed - observations under the two hypotheses, a formalism so orthodox that it is used as an opening example in many detection classes. However, let us insert a new element, and address an issue perhaps with impact on strategies to deal with "big data" applications: What would happen if the structure were streamlined such that data flowed freely throughout the system without provenance? How much information (for detection) is contained in the sample values, and how much in their labels? How should decision-making proceed in this case? The theoretical contribution of this work is to answer these questions by establishing the fundamental limits, in…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
