Loading paper
RDKV: Rate-Distortion Bit Allocation for Joint Eviction and Quantization of the KV Cache | Tomesphere