Loading paper
Characterizing Performance-Energy Trade-offs of Large Language Models in Multi-Request Workflows | Tomesphere