Loading paper
EsoLang-Bench: Evaluating Genuine Reasoning in Large Language Models via Esoteric Programming Languages | Tomesphere