Loading paper
Localize, Group, and Select: Boosting Text-VQA by Scene Text Modeling | Tomesphere