PairAug: What Can Augmented Image-Text Pairs Do for Radiology?
Yutong Xie, Qi Chen, Sinuo Wang, Minh-Son To, Iris Lee, Ee Win Khoo,, Kerolos Hendy, Daniel Koh, Yong Xia, Qi Wu

TL;DR
PairAug introduces a dual-branch data augmentation framework that simultaneously enhances medical image and text datasets in radiology, significantly improving vision-language pre-training performance.
Contribution
The paper proposes a novel PairAug framework with inter- and intra-patient augmentation branches, enabling concurrent image and text data augmentation using LLM-generated reports.
Findings
Outperforms image-only and text-only augmentation methods.
Enhances zero-shot and fine-tuning tasks in medical image classification.
Demonstrates significant improvements over existing VLP baselines.
Abstract
Current vision-language pre-training (VLP) methodologies predominantly depend on paired image-text datasets, a resource that is challenging to acquire in radiology due to privacy considerations and labelling complexities. Data augmentation provides a practical solution to overcome the issue of data scarcity, however, most augmentation methods exhibit a limited focus, prioritising either image or text augmentation exclusively. Acknowledging this limitation, our objective is to devise a framework capable of concurrently augmenting medical image and text data. We design a Pairwise Augmentation (PairAug) approach that contains an Inter-patient Augmentation (InterAug) branch and an Intra-patient Augmentation (IntraAug) branch. Specifically, the InterAug branch of our approach generates radiology images using synthesised yet plausible reports derived from a Large Language Model (LLM). The…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRadiomics and Machine Learning in Medical Imaging · Radiology practices and education · Colorectal Cancer Screening and Detection
