Loading paper
LogicGame: Benchmarking Rule-Based Reasoning Abilities of Large Language Models | Tomesphere