Approximation of the Proximal Operator of the $\ell_\infty$ Norm Using a   Neural Network

Kathryn Linehan; Radu Balan

arXiv:2408.11211·math.NA·August 22, 2024

Approximation of the Proximal Operator of the $\ell_\infty$ Norm Using a Neural Network

Kathryn Linehan, Radu Balan

PDF

Open Access 1 Repo

TL;DR

This paper introduces a neural network-based method to approximate the proximal operator of the infinity norm efficiently, avoiding sorting operations, and demonstrates its accuracy and computational benefits over traditional methods.

Contribution

The authors develop an $O(m)$ neural network approximation for the proximal operator of the infinity norm that handles variable input sizes using feature selection.

Findings

01

The neural network outperforms vanilla neural networks in approximation accuracy.

02

The proposed method is computationally more efficient than exact algorithms.

03

Feature importance analysis shows effective selection of input data features.

Abstract

Computing the proximal operator of the $ℓ_{\infty}$ norm, $prox_{α ∣∣ \cdot ∣ ∣_{\infty}} (x)$ , generally requires a sort of the input data, or at least a partial sort similar to quicksort. In order to avoid using a sort, we present an $O (m)$ approximation of $prox_{α ∣∣ \cdot ∣ ∣_{\infty}} (x)$ using a neural network. A novel aspect of the network is that it is able to accept vectors of varying lengths due to a feature selection process that uses moments of the input data. We present results on the accuracy of the approximation, feature importance, and computational efficiency of the approach. We show that the network outperforms a "vanilla neural network" that does not use feature selection. We also present an algorithm with corresponding theory to calculate $prox_{α ∣∣ \cdot ∣ ∣_{\infty}} (x)$ exactly, relate it to the Moreau…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

klinehan1/prox_op_nn
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsIterative Methods for Nonlinear Equations

MethodsFeature Selection