Loading paper
GIM: Evaluating models via tasks that integrate multiple cognitive domains | Tomesphere