Loading paper
HiVG: Hierarchical Multimodal Fine-grained Modulation for Visual Grounding | Tomesphere