Standardizing Medical Images at Scale for AI
Callen MacPhee, Yiming Zhou, Koichiro Kishima, Bahram Jalali

TL;DR
This paper introduces PhyCV, a physics-based preprocessing framework that standardizes medical images to reduce domain shifts, significantly improving AI model accuracy and robustness across diverse clinical datasets.
Contribution
The paper presents a novel, physics-inspired image standardization method that enhances model generalization in medical imaging by modeling optical physics for deterministic preprocessing.
Findings
Improved breast-cancer classification accuracy from 70.8% to 90.9%.
Matches or exceeds data-augmentation and domain-generalization methods.
Low computational cost and high interpretability.
Abstract
Deep learning has achieved remarkable success in medical image analysis, yet its performance remains highly sensitive to the heterogeneity of clinical data. Differences in imaging hardware, staining protocols, and acquisition conditions produce substantial domain shifts that degrade model generalization across institutions. Here we present a physics-based data preprocessing framework based on the PhyCV (Physics-Inspired Computer Vision) family of algorithms, which standardizes medical images through deterministic transformations derived from optical physics. The framework models images as spatially varying optical fields that undergo a virtual diffractive propagation followed by coherent phase detection. This process suppresses non-semantic variability such as color and illumination differences while preserving diagnostically relevant texture and structural features. When applied to…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAI in cancer detection · Digital Holography and Microscopy · Advanced X-ray Imaging Techniques
