Loading paper
JustLogic: A Comprehensive Benchmark for Evaluating Deductive Reasoning in Large Language Models | Tomesphere