Loading paper
What do Models Learn From Training on More Than Text? Measuring Visual Commonsense Knowledge | Tomesphere