Understanding and inverse design of implicit bias in stochastic learning: a geometric perspective
Nicola Aladrah, Emanuele Ballarin, Matteo Biagetti, Alessio Ansuini, Alberto d'Onofrio, Fabio Anselmi

TL;DR
This paper presents a geometric framework to understand and control implicit bias in stochastic learning, linking it to the interplay of gradient noise and symmetries, enabling inverse design of model biases.
Contribution
It introduces a unifying geometric theory of implicit bias, predicts new behaviors, and demonstrates how to engineer biases through parameterization design.
Findings
The framework predicts implicit bias across various architectures.
Numerical experiments validate the theoretical predictions.
Inverse design can shape biases like sparsity in models.
Abstract
A key challenge in machine learning is to explain how learning dynamics select among the many solutions that achieve identical loss values in overparameterized models - a phenomenon known as implicit bias. Controlling this bias provides a direct mechanism on learned representations, which are central to interpretability, robustness, and reasoning in modern AI systems. Yet, despite its importance, existing explanations remain largely ad hoc and lack a unifying mechanism. We develop a theoretical and constructive framework in which implicit bias emerges as a geometric correction induced by the interplay between gradient noise and continuous symmetries of the loss. We compute the induced bias across a range of architectures, predicting new behaviors and explaining known ones. The approach also enables inverse design: by engineering predictor - preserving parameterizations, it is possible…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
