Loading paper
IWISDM: Assessing instruction following in multimodal models at scale | Tomesphere