Loading paper
Sherry: Hardware-Efficient 1.25-Bit Ternary Quantization via Fine-grained Sparsification | Tomesphere