Preparing data for pathological artificial intelligence with clinical-grade performance
Yuanqing Yang (1), Kai Sun (1), Yanhua Gao (2), Kuangsong Wang (3 and, 4), Gang Yu (1) ((1) Department of Biomedical Engineering, School of Basic, Medical Sciences, Central South University, Changsha, China,(2) Department of, Ultrasound, Shaanxi Provincial People's Hospital

TL;DR
This paper reviews data preparation methods for pathological AI, emphasizing the importance of high-quality, standardized data and weakly supervised learning to achieve clinical-grade diagnostic performance.
Contribution
It provides an in-depth analysis of data preparation techniques and identifies key factors for improving the clinical reproducibility of pathological AI.
Findings
Robust PAI depends on representative, high-quality data
Data standardization and weakly supervised learning improve performance
Multi-center data consistency is crucial for clinical deployment
Abstract
[Purpose] The pathology is decisive for disease diagnosis, but relies heavily on the experienced pathologists. Recently, pathological artificial intelligence (PAI) is thought to improve diagnostic accuracy and efficiency. However, the high performance of PAI based on deep learning in the laboratory generally cannot be reproduced in the clinic. [Methods] Because the data preparation is important for PAI, the paper has reviewed PAI-related studies in the PubMed database published from January 2017 to February 2022, and 118 studies were included. The in-depth analysis of methods for preparing data is performed, including obtaining slides of pathological tissue, cleaning, screening, and then digitizing. Expert review, image annotation, dataset division for model training and validation are also discussed. We further discuss the reasons why the high performance of PAI is not reproducible in…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAI in cancer detection · Radiomics and Machine Learning in Medical Imaging
