Loading paper
Can visual language models resolve textual ambiguity with visual cues? Let visual puns tell you! | Tomesphere