Loading paper
BAGEL: Benchmarking Animal Knowledge Expertise in Language Models | Tomesphere