Loading paper
Permute, Quantize, and Fine-tune: Efficient Compression of Neural Networks | Tomesphere