Data-Dependent Complexity of First-Order Methods for Binary Classification

Matthew Hough; Stephen A. Vavasis

arXiv:2512.03947·math.OC·December 4, 2025

Data-Dependent Complexity of First-Order Methods for Binary Classification

Matthew Hough, Stephen A. Vavasis

PDF

Open Access

TL;DR

This paper analyzes the data-dependent convergence behavior of first-order methods like FISTA for binary classification problems, providing bounds and early stopping criteria that improve efficiency in large-scale data science tasks.

Contribution

It introduces data-dependent iteration bounds and early stopping criteria for FISTA applied to binary classification, enhancing convergence analysis and practical efficiency.

Findings

01

FISTA residual converges to the infimal displacement vector in ESP

02

Data-dependent iteration bounds scale as 1/δ_A^2

03

Early stopping yields well-classified hyperplanes with fewer iterations

Abstract

Large-scale problems in data science are often modeled with optimization, and the optimization model is usually solved with first-order methods that may converge at a sublinear rate. Therefore, it is of interest to terminate the optimization algorithm as soon as the underlying data science task is accomplished. We consider FISTA for solving two binary classification problems: the ellipsoid separation problem (ESP), and the soft-margin support-vector machine (SVM). For the ESP, we cast the dual second-order cone program into a form amenable to FISTA and show that the FISTA residual converges to the infimal displacement vector of the primal-dual hybrid gradient (PDHG) algorithm, that directly encodes a separating hyperplane. We further derive a data-dependent iteration upper bound scaling as $O (1/ δ_{A}^{2})$ , where $δ_{A}$ is the minimal perturbation…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsStochastic Gradient Optimization Techniques · Sparse and Compressive Sensing Techniques · Advanced Optimization Algorithms Research