PathVQA: 30000+ Questions for Medical Visual Question Answering
Xuehai He, Yichen Zhang, Luntian Mou, Eric Xing, Pengtao Xie

TL;DR
PathVQA is a pioneering dataset containing over 30,000 questions and images for medical visual question answering, created to facilitate AI development in pathology diagnosis.
Contribution
This work introduces the first large-scale pathology VQA dataset, overcoming privacy and expertise barriers through semi-automated data collection from textbooks.
Findings
Collected 32,799 questions from 4,998 images
Developed a semi-automated pipeline for dataset creation
Dataset will be publicly released to support medical AI research
Abstract
Is it possible to develop an "AI Pathologist" to pass the board-certified examination of the American Board of Pathology? To achieve this goal, the first step is to create a visual question answering (VQA) dataset where the AI agent is presented with a pathology image together with a question and is asked to give the correct answer. Our work makes the first attempt to build such a dataset. Different from creating general-domain VQA datasets where the images are widely accessible and there are many crowdsourcing workers available and capable of generating question-answer pairs, developing a medical VQA dataset is much more challenging. First, due to privacy concerns, pathology images are usually not publicly available. Second, only well-trained pathologists can understand pathology images, but they barely have time to help create datasets for AI research. To address these challenges, we…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMultimodal Machine Learning Applications · Topic Modeling · Domain Adaptation and Few-Shot Learning
Methods7 Fastest Ways to Call American Airlines Reservations Number (USA Guide)
