Star Discrepancy Subset Selection: Problem Formulation and Efficient Approaches for Low Dimensions
Fran\c{c}ois Cl\`ement, Carola Doerr, Lu\'is Paquete

TL;DR
This paper formulates the star discrepancy subset selection problem, proves its NP-hardness, and proposes MILP and branch-and-bound methods, showing their efficiency in low dimensions and potential for generating low-discrepancy point sets.
Contribution
It introduces the star discrepancy subset selection problem, provides formulations and algorithms, and demonstrates their effectiveness and limitations in low-dimensional settings.
Findings
MILP and BB are efficient in 2D for certain set sizes
Selected subsets have lower discrepancy than common sequences
Performance declines in higher dimensions and larger sets
Abstract
Motivated by applications in instance selection, we introduce the star discrepancy subset selection problem, which consists of finding a subset of m out of n points that minimizes the star discrepancy. First, we show that this problem is NP-hard. Then, we introduce a mixed integer linear formulation (MILP) and a combinatorial branch-and-bound (BB) algorithm for the star discrepancy subset selection problem and we evaluate both approaches against random subset selection and a greedy construction on different use-cases in dimension two and three. Our results show that the MILP and BB are efficient in dimension two for large and small ratio, respectively, and for not too large n. However, the performance of both approaches decays strongly for larger dimensions and set sizes. As a side effect of our empirical comparisons we obtain point sets of discrepancy values that are much…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsColorectal Cancer Surgical Treatments · Cultural Heritage Materials Analysis · Image and Object Detection Techniques
