Approximate Maximum Halfspace Discrepancy
Michael Matheny, Jeff M. Phillips

TL;DR
This paper presents an efficient algorithm for approximating the maximum discrepancy between red and blue points in a geometric range space defined by halfspaces, with applications in machine learning and spatial statistics.
Contribution
It introduces a nearly optimal algorithm for approximate maximum halfspace discrepancy with improved runtime and a new data structure for halfspace range counting.
Findings
Achieves $O(|X| + (1/^d) \u221a) time complexity
Provides nearly tight lower bounds for $d=2$ case
Develops a new -approximate halfspace range counting data structure
Abstract
Consider the geometric range space where and is the set of ranges defined by -dimensional halfspaces. In this setting we consider that is the disjoint union of a red and blue set. For each halfspace define a function that measures the "difference" between the fraction of red and fraction of blue points which fall in the range . In this context the maximum discrepancy problem is to find the . We aim to instead find an such that . This is the central problem in linear classification for machine learning, in spatial scan statistics for spatial anomaly detection, and shows up in many other areas. We provide a solution for this problem in $O(|X| + (1/\varepsilon^d) \log^4…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
