Loading paper
Multimodal Inverse Cloze Task for Knowledge-based Visual Question Answering | Tomesphere