Zero-Shot Transfer VQA Dataset
Yuanpeng Li, Yi Yang, Jianyu Wang, Wei Xu

TL;DR
This paper introduces ZST-VQA, a new dataset for zero-shot transfer in visual question answering, highlighting the challenge of transferring knowledge between question understanding and answering, and evaluating current models' limitations.
Contribution
The paper presents the ZST-VQA dataset, reorganized from VQA v1.0, to specifically evaluate zero-shot transfer capabilities in VQA models, a novel benchmark for this problem.
Findings
Existing models perform poorly on zero-shot transfer tasks.
Performance drops indicate current methods do not effectively handle zero-shot transfer.
Implicit bias during training may hinder transfer learning in VQA models.
Abstract
Acquiring a large vocabulary is an important aspect of human intelligence. Onecommon approach for human to populating vocabulary is to learn words duringreading or listening, and then use them in writing or speaking. This ability totransfer from input to output is natural for human, but it is difficult for machines.Human spontaneously performs this knowledge transfer in complicated multimodaltasks, such as Visual Question Answering (VQA). In order to approach human-levelArtificial Intelligence, we hope to equip machines with such ability. Therefore, toaccelerate this research, we propose a newzero-shot transfer VQA(ZST-VQA)dataset by reorganizing the existing VQA v1.0 dataset in the way that duringtraining, some words appear only in one module (i.e. questions) but not in theother (i.e. answers). In this setting, an intelligent model should understand andlearn the concepts from one…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Medical Imaging Techniques and Applications · Medical Imaging and Analysis
