Solving Empirical Risk Minimization in the Current Matrix Multiplication Time
Yin Tat Lee, Zhao Song, Qiuyi Zhang

TL;DR
This paper introduces a new algorithm for solving a broad class of convex optimization problems related to empirical risk minimization, achieving near-optimal matrix multiplication time with robust deterministic methods and efficient data structures.
Contribution
The paper presents a deterministic interior point method with a novel data structure, extending current matrix multiplication time algorithms to a wider range of convex problems.
Findings
Achieves runtime close to matrix multiplication bounds for empirical risk minimization.
Provides a robust deterministic central path algorithm.
Extends recent linear programming solutions to broader convex problems.
Abstract
Many convex problems in machine learning and computer science share the same form: \begin{align*} \min_{x} \sum_{i} f_i( A_i x + b_i), \end{align*} where are convex functions on with constant , , and . This problem generalizes linear programming and includes many problems in empirical risk minimization. In this paper, we give an algorithm that runs in time \begin{align*} O^* ( ( n^{\omega} + n^{2.5 - \alpha/2} + n^{2+ 1/6} ) \log (n / \delta) ) \end{align*} where is the exponent of matrix multiplication, is the dual exponent of matrix multiplication, and is the relative accuracy. Note that the runtime has only a log dependence on the condition numbers or other data dependent parameters and these are captured in . For the current bound $\omega…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStochastic Gradient Optimization Techniques · Complexity and Algorithms in Graphs · Sparse and Compressive Sensing Techniques
