Loading paper
MISR: Measuring Instrumental Self-Reasoning in Frontier Models | Tomesphere