Loading paper
Rate-Distortion Optimization for Transformer Inference | Tomesphere