Interpretable global minima of deep ReLU neural networks on sequentially separable data

Thomas Chen; Patr\'icia Mu\~noz Ewald

arXiv:2405.07098·cs.LG·January 14, 2026

Interpretable global minima of deep ReLU neural networks on sequentially separable data

Thomas Chen, Patr\'icia Mu\~noz Ewald

PDF

Open Access 1 Video

TL;DR

This paper constructs explicit zero-loss neural network classifiers for sequentially separable data, revealing the structure of global minima in terms of cumulative parameters and recursive truncation maps.

Contribution

It provides a novel explicit construction of global minima for deep ReLU networks on specific separable data, with a parameterization that clarifies the network's structure.

Findings

01

Global minimizers can be explicitly described with Q(M+2) parameters.

02

Configurations include well-separated clusters and sequential linear separability.

03

The approach offers insights into the structure of optimal neural network classifiers.

Abstract

We explicitly construct zero loss neural network classifiers. We write the weight matrices and bias vectors in terms of cumulative parameters, which determine truncation maps acting recursively on input space. The configurations for the training data considered are (i) sufficiently small, well separated clusters corresponding to each class, and (ii) equivalence classes which are sequentially linearly separable. In the best case, for $Q$ classes of data in $R^{M}$ , global minimizers can be described with $Q (M + 2)$ parameters.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

Interpretable Global Minima of Deep ReLU Neural Networks on Sequentially Separable Data· slideslive

Taxonomy

TopicsNeural Networks and Applications