Loading paper
Understanding Efficiency: Quantization, Batching, and Serving Strategies in LLM Energy Use | Tomesphere