Loading paper
DEVICE: Depth and Visual Concepts Aware Transformer for OCR-based Image Captioning | Tomesphere