BenchX: A Unified Benchmark Framework for Medical Vision-Language Pretraining on Chest X-Rays
Yang Zhou, Tan Li Hui Faith, Yanyu Xu, Sicong Leng, Xinxing Xu, Yong, Liu, Rick Siow Mong Goh

TL;DR
BenchX is a comprehensive benchmark framework for evaluating and comparing medical vision-language pretraining methods on chest X-ray tasks, promoting standardized assessment and analysis.
Contribution
It introduces a unified benchmark with diverse datasets, standardized protocols, and evaluation procedures for MedVLP methods on chest X-ray tasks.
Findings
Some early MedVLP methods outperform recent ones when properly benchmarked.
Benchmarking reveals the need to revisit previous conclusions in MedVLP research.
Standardized evaluation can guide future improvements in MedVLP models.
Abstract
Medical Vision-Language Pretraining (MedVLP) shows promise in learning generalizable and transferable visual representations from paired and unpaired medical images and reports. MedVLP can provide useful features to downstream tasks and facilitate adapting task-specific models to new setups using fewer examples. However, existing MedVLP methods often differ in terms of datasets, preprocessing, and finetuning implementations. This pose great challenges in evaluating how well a MedVLP method generalizes to various clinically-relevant tasks due to the lack of unified, standardized, and comprehensive benchmark. To fill this gap, we propose BenchX, a unified benchmark framework that enables head-to-head comparison and systematical analysis between MedVLP methods using public chest X-ray datasets. Specifically, BenchX is composed of three components: 1) Comprehensive datasets covering nine…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
TopicsImage Retrieval and Classification Techniques · Biomedical Text Mining and Ontologies · Lung Cancer Diagnosis and Treatment
