Loading paper
MARTHE: Scheduling the Learning Rate Via Online Hypergradients | Tomesphere