Full-Scale Indexing and Semantic Annotation of CT Imaging: Boosting FAIRness
Hannes Ulrich, Robin Hendel, Santiago Pazmino, Bj\"orn Bergh, Bj\"orn, Schreiweis

TL;DR
This paper presents a comprehensive method for indexing and semantically annotating large-scale CT imaging datasets to enhance their findability, interoperability, and reusability, thereby improving data FAIRness for AI applications.
Contribution
It introduces an automated process that semantically enriches CT images with SNOMED CT annotations and standardizes metadata with HL7 FHIR, enabling large-scale, interoperable medical imaging data integration.
Findings
Semantic enrichment of over 230,000 CT series
Integration of 8 million SNOMED CT annotations
Enhanced data discoverability and interoperability
Abstract
Background: The integration of artificial intelligence into medicine has led to significant advances, particularly in diagnostics and treatment planning. However, the reliability of AI models is highly dependent on the quality of the training data, especially in medical imaging, where varying patient data and evolving medical knowledge pose a challenge to the accuracy and generalizability of given datasets. Results: The proposed approach focuses on the integration and enhancement of clinical computed tomography (CT) image series for better findability, accessibility, interoperability, and reusability. Through an automated indexing process, CT image series are semantically enhanced using the TotalSegmentator framework for segmentation and resulting SNOMED CT annotations. The metadata is standardized with HL7 FHIR resources to enable efficient data recognition and data exchange between…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsScientific Computing and Data Management · Research Data Management Practices · Radiomics and Machine Learning in Medical Imaging
