Loading paper
Training wide residual networks for deployment using a single bit for each weight | Tomesphere