Loading paper
Implicit bias of SGD in $L_{2}$-regularized linear DNNs: One-way jumps from high to low rank | Tomesphere