Loading paper
Less Memory Means smaller GPUs: Backpropagation with Compressed Activations | Tomesphere