Combinatorial optimization of the coefficient of determination
Marc Harary

TL;DR
This paper introduces a novel combinatorial geometry algorithm called quadratic sweep for selecting the optimal subset of points with the highest coefficient of determination, demonstrating high accuracy and efficiency through extensive experiments.
Contribution
The paper presents the quadratic sweep algorithm, a new method leveraging geometric projections and topological sweep to optimize the coefficient of determination in subset selection.
Findings
Achieves optimal subset selection in extensive experiments up to n=30.
Demonstrates the method's efficiency with a topological sweep in (n^5 log n) time.
Provides open-source implementation and reproducible experiments.
Abstract
Robust correlation analysis is among the most critical challenges in statistics. Herein, we develop an efficient algorithm for selecting the - subset of points in the plane with the highest coefficient of determination . Drawing from combinatorial geometry, we propose a method called the \textit{quadratic sweep} that consists of two steps: (i) projectively lifting the data points into and then (ii) iterating over each linearly separable -subset. Its basis is that the optimal set of outliers is separable from its complement in by a conic section, which, in , can be found by a topological sweep in time. Although key proofs of quadratic separability remain underway, we develop strong mathematical intuitions for our conjectures, then experimentally demonstrate our method's optimality…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Scientific Research Methods · Statistical and Computational Modeling · Water Quality Monitoring and Analysis
MethodsSparse Evolutionary Training
