Loading paper
Mixed Precision Low-bit Quantization of Neural Network Language Models for Speech Recognition | Tomesphere