Understanding and inverse design of implicit bias in stochastic learning: a geometric perspective

Nicola Aladrah; Emanuele Ballarin; Matteo Biagetti; Alessio Ansuini; Alberto d'Onofrio; Fabio Anselmi

arXiv:2601.06597·cs.LG·April 7, 2026

Understanding and inverse design of implicit bias in stochastic learning: a geometric perspective

Nicola Aladrah, Emanuele Ballarin, Matteo Biagetti, Alessio Ansuini, Alberto d'Onofrio, Fabio Anselmi

PDF

TL;DR

This paper presents a geometric framework to understand and control implicit bias in stochastic learning, linking it to the interplay of gradient noise and symmetries, enabling inverse design of model biases.

Contribution

It introduces a unifying geometric theory of implicit bias, predicts new behaviors, and demonstrates how to engineer biases through parameterization design.

Findings

01

The framework predicts implicit bias across various architectures.

02

Numerical experiments validate the theoretical predictions.

03

Inverse design can shape biases like sparsity in models.

Abstract

A key challenge in machine learning is to explain how learning dynamics select among the many solutions that achieve identical loss values in overparameterized models - a phenomenon known as implicit bias. Controlling this bias provides a direct mechanism on learned representations, which are central to interpretability, robustness, and reasoning in modern AI systems. Yet, despite its importance, existing explanations remain largely ad hoc and lack a unifying mechanism. We develop a theoretical and constructive framework in which implicit bias emerges as a geometric correction induced by the interplay between gradient noise and continuous symmetries of the loss. We compute the induced bias across a range of architectures, predicting new behaviors and explaining known ones. The approach also enables inverse design: by engineering predictor - preserving parameterizations, it is possible…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.