Max-affine regression via first-order methods

Seonho Kim; Kiryung Lee

arXiv:2308.08070·stat.ML·August 17, 2023

Max-affine regression via first-order methods

Seonho Kim, Kiryung Lee

PDF

Open Access

TL;DR

This paper analyzes the convergence of gradient-based methods for max-affine regression, demonstrating linear convergence under certain conditions and showing SGD's efficiency in noisy and low-sample scenarios.

Contribution

It provides the first non-asymptotic convergence analysis of GD and SGD for max-affine regression with theoretical guarantees.

Findings

01

GD and SGD converge linearly to a neighborhood of the true model

02

SGD outperforms alternating minimization and GD in noisy, low-sample settings

03

Numerical results support the theoretical convergence guarantees

Abstract

We consider regression of a max-affine model that produces a piecewise linear model by combining affine models via the max function. The max-affine model ubiquitously arises in applications in signal processing and statistics including multiclass classification, auction problems, and convex regression. It also generalizes phase retrieval and learning rectifier linear unit activation functions. We present a non-asymptotic convergence analysis of gradient descent (GD) and mini-batch stochastic gradient descent (SGD) for max-affine regression when the model is observed at random locations following the sub-Gaussianity and an anti-concentration with additive sub-Gaussian noise. Under these assumptions, a suitably initialized GD and SGD converge linearly to a neighborhood of the ground truth specified by the corresponding error bound. We provide numerical results that corroborate the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsStatistical Methods and Inference · Stochastic Gradient Optimization Techniques · Advanced X-ray and CT Imaging

MethodsStochastic Gradient Descent