SOCP relaxation bounds for the optimal subset selection problem applied to robust linear regression
Salvador Flores

TL;DR
This paper introduces a novel semidefinite programming-based bounding technique within a branch and bound framework to efficiently find globally optimal subsets for robust linear regression, significantly reducing computation time.
Contribution
The paper presents a new pruning strategy using pseudo-convexifications and semidefinite bounds to improve global optimization of subset selection in robust regression.
Findings
Reduces computation time for LTSE problem
Maintains high accuracy with negligible loss
Performs well on difficult instances
Abstract
This paper deals with the problem of finding the globally optimal subset of h elements from a larger set of n elements in d space dimensions so as to minimize a quadratic criterion, with an special emphasis on applications to computing the Least Trimmed Squares Estimator (LTSE) for robust regression. The computation of the LTSE is a challenging subset selection problem involving a nonlinear program with continuous and binary variables, linked in a highly nonlinear fashion. The selection of a globally optimal subset using the branch and bound (BB) algorithm is limited to problems in very low dimension, tipically d<5, as the complexity of the problem increases exponentially with d. We introduce a bold pruning strategy in the BB algorithm that results in a significant reduction in computing time, at the price of a negligeable accuracy lost. The novelty of our algorithm is that the bounds…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
