Loading paper
WHODUNIT: Evaluation benchmark for culprit detection in mystery stories | Tomesphere