Loading paper
ExpVG: Investigating the Design Space of Visual Grounding in Multimodal Large Language Model | Tomesphere