Loading paper
A Multi-Agent Pokemon Tournament for Evaluating Strategic Reasoning of Large Language Models | Tomesphere