Loading paper
VLMs Need Words: Vision Language Models Ignore Visual Detail In Favor of Semantic Anchors | Tomesphere