Region Seeding via Pre-Activation Regularization: A Geometric View of Piecewise Affine Neural Networks

Yi Wei; Xuan Qi; Furao Shen

arXiv:2605.06300·cs.LG·May 12, 2026

Region Seeding via Pre-Activation Regularization: A Geometric View of Piecewise Affine Neural Networks

Yi Wei, Xuan Qi, Furao Shen

PDF

TL;DR

This paper introduces a regularizer that encourages neural networks to partition the input space early during training, enhancing their expressive capacity and improving performance on various datasets.

Contribution

It provides a geometric theory linking neuron switching surfaces to local partitioning, and proposes a practical regularizer for seeding data-relevant regions early in training.

Findings

01

Regularizer increases the number of realized affine regions.

02

Improves early-stage accuracy on toy datasets.

03

Achieves comparable or slightly better final accuracy on ImageNet-1k.

Abstract

Deep networks with continuous piecewise affine activations induce polyhedral partitions of the input space, making the number of realized affine regions a natural measure of expressive capacity and a key determinant of how well the model can approximate nonlinear target functions. In practice, standard training realizes far fewer region refinements in data-visited neighborhoods than the architecture could in principle support, while existing region-count theory is primarily architectural and offers little guidance on how optimization shapes the realized partition near the data. Our theory provides a sufficient condition under which bringing neuron switching surfaces sufficiently close to data points ensures their intersection with local neighborhoods, which in turn implies a strict increase in the local affine-region count, yielding a principled training-time handle for seeding…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.