Training-Time Batch Normalization Reshapes Local Partition Geometry in Piecewise-Affine Networks
Xuan Qi, Yi Wei, Fanqi Yu, Furao Shen, Vittorio Murino, Cigdem Beyan

TL;DR
This paper investigates how training-time batch normalization influences the geometric partitioning of the input space in piecewise-affine neural networks, revealing its role as a batch-conditional recentering mechanism.
Contribution
It provides a geometric analysis of batch normalization's effect on local partition refinement in ReLU networks during training.
Findings
BN defines reference hyperplanes through batch centroids.
BN increases expected local partition refinement under certain conditions.
The mechanism transfers locally through depth where the representation is affine.
Abstract
Batch normalization (BN) is central to modern deep networks, but its effect on the realized function during training remains less understood than its optimization benefits. We study training-time BN in continuous piecewise-affine (CPA) networks through the geometry of switching hyperplanes and the induced affine-region partition. Conditioned on a mini-batch, we show that BN defines for each neuron a reference hyperplane through the batch centroid, and that breakpoint-switching hyperplanes are parallel translates whose offsets are expressed in batch-standardized coordinates and are independent of the raw bias. This yields an exact criterion for when a switching hyperplane intersects a local window and motivates a local region-density functional based on exact affine-region counts. Under explicit sufficient conditions, we show that BN increases expected local partition…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
