Loading paper
On the robustness of multimodal language model towards distractions | Tomesphere