Least Squares Revisited: Scalable Approaches for Multi-class Prediction

Alekh Agarwal; Sham M. Kakade; Nikos Karampatziakis; Le Song; Gregory; Valiant

arXiv:1310.1949·cs.LG·October 22, 2013·21 cites

Least Squares Revisited: Scalable Approaches for Multi-class Prediction

Alekh Agarwal, Sham M. Kakade, Nikos Karampatziakis, Le Song, Gregory, Valiant

PDF

Open Access

TL;DR

This paper introduces simple, scalable, and parameter-free iterative least-squares algorithms for multi-class and multi-label prediction that outperform first-order methods both theoretically and empirically.

Contribution

It presents new iterative least-squares algorithms with convergence guarantees and demonstrates their practical efficiency and accuracy on large-scale datasets.

Findings

01

Algorithms outperform first-order methods in practice

02

Achieve significant speedups over Liblinear and Vowpal Wabbit

03

Attain state-of-the-art accuracy on MNIST and CIFAR-10

Abstract

This work provides simple algorithms for multi-class (and multi-label) prediction in settings where both the number of examples n and the data dimension d are relatively large. These robust and parameter free algorithms are essentially iterative least-squares updates and very versatile both in theory and in practice. On the theoretical front, we present several variants with convergence guarantees. Owing to their effective use of second-order structure, these algorithms are substantially better than first-order methods in many practical scenarios. On the empirical side, we present a scalable stagewise variant of our approach, which achieves dramatic computational speedups over popular optimization packages such as Liblinear and Vowpal Wabbit on standard datasets (MNIST and CIFAR-10), while attaining state-of-the-art accuracies.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMachine Learning and Algorithms · Machine Learning and Data Classification · Domain Adaptation and Few-Shot Learning