Large Multimodal Model based Standardisation of Pathology Reports with Confidence and their Prognostic Significance
Ethar Alzaid, Gabriele Pergola, Harriet Evans, David Snead, Fayyaz, Minhas

TL;DR
This paper introduces a large multimodal model framework that automatically extracts and standardizes information from pathology reports, providing confidence scores and demonstrating prognostic significance for patient stratification.
Contribution
It presents a novel two-stage prompting approach using LMMs for extracting and validating report data with confidence scores, applicable to diverse report formats.
Findings
Confidence scores effectively indicate extraction accuracy.
The extracted data has significant prognostic value.
Framework generalizes across multiple medical centers.
Abstract
Pathology reports are rich in clinical and pathological details but are often presented in free-text format. The unstructured nature of these reports presents a significant challenge limiting the accessibility of their content. In this work, we present a practical approach based on the use of large multimodal models (LMMs) for automatically extracting information from scanned images of pathology reports with the goal of generating a standardised report specifying the value of different fields along with estimated confidence about the accuracy of the extracted fields. The proposed approach overcomes limitations of existing methods which do not assign confidence scores to extracted fields limiting their practical use. The proposed framework uses two stages of prompting a Large Multimodal Model (LMM) for information extraction and validation. The framework generalises to textual reports…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRadiomics and Machine Learning in Medical Imaging · AI in cancer detection · Biomedical Text Mining and Ontologies
