Loading paper
Ps and Qs: Quantization-aware pruning for efficient low latency neural network inference | Tomesphere