WsiCaption: Multiple Instance Generation of Pathology Reports for   Gigapixel Whole-Slide Images

Pingyi Chen; Honglin Li; Chenglu Zhu; Sunyi Zheng; Zhongyi Shui; Lin; Yang

arXiv:2311.16480·cs.CV·June 28, 2024·2 cites

WsiCaption: Multiple Instance Generation of Pathology Reports for Gigapixel Whole-Slide Images

Pingyi Chen, Honglin Li, Chenglu Zhu, Sunyi Zheng, Zhongyi Shui, Lin, Yang

PDF

Open Access 1 Repo

TL;DR

This paper introduces WsiCaption, a model that generates detailed pathology reports from gigapixel whole-slide images, supported by a large curated dataset, improving automation and accuracy in digital pathology.

Contribution

We present a new large-scale WSI-text dataset (PathText) and a multiple instance generative model (MI-Gen) for automatic pathology report generation from gigapixel images.

Findings

01

Our model produces reports with multiple clinical clues.

02

Achieves competitive performance on slide-level tasks.

03

Semantic report extraction surpasses previous methods in BRCA subtyping.

Abstract

Whole slide images are the foundation of digital pathology for the diagnosis and treatment of carcinomas. Writing pathology reports is laborious and error-prone for inexperienced pathologists. To reduce the workload and improve clinical automation, we investigate how to generate pathology reports given whole slide images. On the data end, we curated the largest WSI-text dataset (PathText). In specific, we collected nearly 10000 high-quality WSI-text pairs for visual-language models by recognizing and cleaning pathology reports which narrate diagnostic slides in TCGA. On the model end, we propose the multiple instance generative model (MI-Gen) which can produce pathology reports for gigapixel WSIs. We benchmark our model on the largest subset of TCGA-PathoText. Experimental results show our model can generate pathology reports which contain multiple clinical clues and achieve competitive…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

cpystan/wsi-caption
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAI in cancer detection · Video Analysis and Summarization · Multimodal Machine Learning Applications