Loading paper
SLO-aware GPU Frequency Scaling for Energy Efficient LLM Inference Serving | Tomesphere