Loading paper
When Quantization Is Free: An int4 KV Cache That Outruns fp16 on Apple Silicon | Tomesphere