PathInsight: Instruction Tuning of Multimodal Datasets and Models for   Intelligence Assisted Diagnosis in Histopathology

Xiaomin Wu; Rui Xu; Pengchen Wei; Wenkang Qin; Peixiang Huang; Ziheng; Li; Lin Luo

arXiv:2408.07037·cs.CV·August 14, 2024·2 cites

PathInsight: Instruction Tuning of Multimodal Datasets and Models for Intelligence Assisted Diagnosis in Histopathology

Xiaomin Wu, Rui Xu, Pengchen Wei, Wenkang Qin, Peixiang Huang, Ziheng, Li, Lin Luo

PDF

Open Access

TL;DR

PathInsight introduces a large, meticulously curated multimodal dataset and fine-tuned models for improved diagnosis in histopathology, aiming to bridge the gap between advanced AI models and clinical application.

Contribution

The paper presents a new extensive dataset of 45,000 cases and fine-tuned multimodal models tailored for pathological diagnosis tasks.

Findings

01

Fine-tuned models perform well on image captioning and classification tasks.

02

Models show proficiency in addressing typical pathological questions.

03

Public release of models and datasets to aid research.

Abstract

Pathological diagnosis remains the definitive standard for identifying tumors. The rise of multimodal large models has simplified the process of integrating image analysis with textual descriptions. Despite this advancement, the substantial costs associated with training and deploying these complex multimodal models, together with a scarcity of high-quality training datasets, create a significant divide between cutting-edge technology and its application in the clinical setting. We had meticulously compiled a dataset of approximately 45,000 cases, covering over 6 different tasks, including the classification of organ tissues, generating pathology report descriptions, and addressing pathology-related questions and answers. We have fine-tuned multimodal large models, specifically LLaVA, Qwen-VL, InternLM, with this dataset to enhance instruction-based performance. We conducted a…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAI in cancer detection

MethodsBalanced Selection