Loading paper
Gather and Trace: Rethinking Video TextVQA from an Instance-oriented Perspective | Tomesphere