Loading paper
Bear the Query in Mind: Visual Grounding with Query-conditioned Convolution | Tomesphere