Medical Image Deidentification, Cleaning and Compression Using Pylogik
Adrienne Kline, Vinesh Appadurai, Yuan Luo, Sanjiv Shah

TL;DR
PyLogik is a Python library that de-identifies, compresses, and prepares ultrasound images for data sharing and deep learning, achieving high accuracy and significant file size reduction.
Contribution
The paper introduces PyLogik, a novel Python-based tool for ultrasound image de-identification, cleaning, and compression, with demonstrated effectiveness and adaptability to other medical images.
Findings
Average Dice coefficient of 0.976 for de-identification accuracy
Approximately 72% reduction in image file size
Effective processing of ultrasound data for machine learning
Abstract
Leveraging medical record information in the era of big data and machine learning comes with the caveat that data must be cleaned and de-identified. Facilitating data sharing and harmonization for multi-center collaborations are particularly difficult when protected health information (PHI) is contained or embedded in image meta-data. We propose a novel library in the Python framework, called PyLogik, to help alleviate this issue for ultrasound images, which are particularly challenging because of the frequent inclusion of PHI directly on the images. PyLogik processes the image volumes through a series of text detection/extraction, filtering, thresholding, morphological and contour comparisons. This methodology de-identifies the images, reduces file sizes, and prepares image volumes for applications in deep learning and data sharing. To evaluate its effectiveness in processing…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRadiomics and Machine Learning in Medical Imaging · AI in cancer detection · Artificial Intelligence in Healthcare and Education
MethodsLib
