Loading paper
Combining learning rate decay and weight decay with complexity gradient descent - Part I | Tomesphere