Loading paper
Beyond Text-Visual Attention: Exploiting Visual Cues for Effective Token Pruning in VLMs | Tomesphere