MedTri: A Platform for Structured Medical Report Normalization to Enhance Vision-Language Pretraining
Yuetan Chu, Xinhua Ma, Xinran Jin, Gongning Luo, Xin Gao

TL;DR
MedTri introduces a structured normalization framework for medical reports that enhances vision-language pretraining by converting reports into anatomy-grounded triplets, improving model robustness and generalization.
Contribution
This work presents MedTri, a deployable platform for normalizing medical reports into structured triplets, systematically demonstrating its benefits for vision-language pretraining.
Findings
Structured normalization improves pretraining quality.
Normalization supports augmentation strategies.
Consistent gains across multiple datasets.
Abstract
Medical vision-language pretraining increasingly relies on medical reports as large-scale supervisory signals; however, raw reports often exhibit substantial stylistic heterogeneity, variable length, and a considerable amount of image-irrelevant content. Although text normalization is frequently adopted as a preprocessing step in prior work, its design principles and empirical impact on vision-language pretraining remain insufficiently and systematically examined. In this study, we present MedTri, a deployable normalization framework for medical vision-language pretraining that converts free-text reports into a unified [Anatomical Entity: Radiologic Description + Diagnosis Category] triplet. This structured, anatomy-grounded normalization preserves essential morphological and spatial information while removing stylistic noise and image-irrelevant content, providing consistent and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMultimodal Machine Learning Applications · Radiology practices and education · Medical Imaging and Analysis
