Loading paper
TransVG: End-to-End Visual Grounding with Transformers | Tomesphere