Loading paper
MaxVA: Fast Adaptation of Step Sizes by Maximizing Observed Variance of Gradients | Tomesphere