Knowledge-driven AI-generated data for accurate and interpretable breast ultrasound diagnoses
Haojun Yu, Youcheng Li, Nan Zhang, Zihan Niu, Xuantong Gong, Yanwen, Luo, Quanlin Wu, Wangyan Qin, Mengyuan Zhou, Jie Han, Jia Tao, Ziwei Zhao, Di, Dai, Di He, Dong Wang, Binghui Tang, Ling Huo, Qingli Zhu, Yong Wang and, Liwei Wang

TL;DR
This paper introduces TAILOR, a knowledge-driven generative pipeline that creates synthetic breast ultrasound data to improve diagnosis accuracy, especially for rare cases, outperforming radiologists in specificity and interpretability.
Contribution
The study presents a novel knowledge-driven generative model that produces tailored synthetic data to enhance deep learning diagnosis of rare breast ultrasound cases.
Findings
Model outperforms nine radiologists in specificity by 33.5%.
Generated data significantly improves diagnosis accuracy for rare cases.
Model excels in diagnosing ductal carcinoma in situ (DCIS).
Abstract
Data-driven deep learning models have shown great capabilities to assist radiologists in breast ultrasound (US) diagnoses. However, their effectiveness is limited by the long-tail distribution of training data, which leads to inaccuracies in rare cases. In this study, we address a long-standing challenge of improving the diagnostic model performance on rare cases using long-tailed data. Specifically, we introduce a pipeline, TAILOR, that builds a knowledge-driven generative model to produce tailored synthetic data. The generative model, using 3,749 lesions as source data, can generate millions of breast-US images, especially for error-prone rare cases. The generated data can be further used to build a diagnostic model for accurate and interpretable diagnoses. In the prospective external evaluation, our diagnostic model outperforms the average performance of nine radiologists by 33.5% in…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAI in cancer detection · Radiomics and Machine Learning in Medical Imaging
