Loading paper
Evaluating Large Language Models with NeuBAROCO: Syllogistic Reasoning Ability and Human-like Biases | Tomesphere