Loading paper
Confidence-aware Non-repetitive Multimodal Transformers for TextCaps | Tomesphere