Loading paper
Beyond Test-Time Compute Strategies: Advocating Energy-per-Token in LLM Inference | Tomesphere