MedICaT: A Dataset of Medical Images, Captions, and Textual References

Sanjay Subramanian; Lucy Lu Wang; Sachin Mehta; Ben Bogin; Madeleine; van Zuylen; Sravanthi Parasa; Sameer Singh; Matt Gardner; Hannaneh Hajishirzi

arXiv:2010.06000·cs.CV·October 14, 2020

MedICaT: A Dataset of Medical Images, Captions, and Textual References

Sanjay Subramanian, Lucy Lu Wang, Sachin Mehta, Ben Bogin, Madeleine, van Zuylen, Sravanthi Parasa, Sameer Singh, Matt Gardner, Hannaneh Hajishirzi

PDF

1 Repo 1 Models

TL;DR

MedICaT is a large dataset of medical images with captions and references, enabling new research in figure-text understanding and alignment in biomedical literature.

Contribution

The paper introduces MedICaT, a comprehensive dataset of 217K medical images with annotations, and proposes the subfigure-to-subcaption alignment task for scientific document understanding.

Findings

01

MedICaT enables improved figure-text alignment methods.

02

Inline references enhance image-text matching accuracy.

03

Subfigure-to-subcaption alignment is feasible with the dataset.

Abstract

Understanding the relationship between figures and text is key to scientific document understanding. Medical figures in particular are quite complex, often consisting of several subfigures (75% of figures in our dataset), with detailed text describing their content. Previous work studying figures in scientific papers focused on classifying figure content rather than understanding how images relate to the text. To address challenges in figure retrieval and figure-to-text alignment, we introduce MedICaT, a dataset of medical images in context. MedICaT consists of 217K images from 131K open access biomedical papers, and includes captions, inline references for 74% of figures, and manually annotated subfigures and subcaptions for a subset of figures. Using MedICaT, we introduce the task of subfigure to subcaption alignment in compound figures and demonstrate the utility of inline references…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

allenai/medicat
pytorchOfficial

Models

🤗
SmallBosser/PeFoMed
model· ♡ 2
♡ 2

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.