Data-Dependent Complexity of First-Order Methods for Binary Classification
Matthew Hough, Stephen A. Vavasis

TL;DR
This paper analyzes the data-dependent convergence behavior of first-order methods like FISTA for binary classification problems, providing bounds and early stopping criteria that improve efficiency in large-scale data science tasks.
Contribution
It introduces data-dependent iteration bounds and early stopping criteria for FISTA applied to binary classification, enhancing convergence analysis and practical efficiency.
Findings
FISTA residual converges to the infimal displacement vector in ESP
Data-dependent iteration bounds scale as 1/δ_A^2
Early stopping yields well-classified hyperplanes with fewer iterations
Abstract
Large-scale problems in data science are often modeled with optimization, and the optimization model is usually solved with first-order methods that may converge at a sublinear rate. Therefore, it is of interest to terminate the optimization algorithm as soon as the underlying data science task is accomplished. We consider FISTA for solving two binary classification problems: the ellipsoid separation problem (ESP), and the soft-margin support-vector machine (SVM). For the ESP, we cast the dual second-order cone program into a form amenable to FISTA and show that the FISTA residual converges to the infimal displacement vector of the primal-dual hybrid gradient (PDHG) algorithm, that directly encodes a separating hyperplane. We further derive a data-dependent iteration upper bound scaling as , where is the minimal perturbation…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStochastic Gradient Optimization Techniques · Sparse and Compressive Sensing Techniques · Advanced Optimization Algorithms Research
