Exploring visual language models as a powerful tool in the diagnosis of   Ewing Sarcoma

Alvaro Pastor-Naranjo; Pablo Meseguer; Roc\'io del Amor; Jose Antonio; Lopez-Guerrero; Samuel Navarro; Katia Scotlandi; Antonio Llombart-Bosch,; Isidro Machado; Valery Naranjo

arXiv:2501.08042·cs.CV·January 15, 2025

Exploring visual language models as a powerful tool in the diagnosis of Ewing Sarcoma

Alvaro Pastor-Naranjo, Pablo Meseguer, Roc\'io del Amor, Jose Antonio, Lopez-Guerrero, Samuel Navarro, Katia Scotlandi, Antonio Llombart-Bosch,, Isidro Machado, Valery Naranjo

PDF

TL;DR

This paper investigates the use of vision-language models for diagnosing Ewing Sarcoma from histopathological images, demonstrating improved accuracy and efficiency over traditional pre-training methods.

Contribution

It introduces the application of vision-language supervision for ES diagnosis, showing significant accuracy gains and reduced computational costs compared to standard methods.

Findings

01

VLS improves diagnostic accuracy for ES.

02

VLS reduces model complexity and training costs.

03

In-domain VLS models outperform ImageNet pre-trained models.

Abstract

Ewing's sarcoma (ES), characterized by a high density of small round blue cells without structural organization, presents a significant health concern, particularly among adolescents aged 10 to 19. Artificial intelligence-based systems for automated analysis of histopathological images are promising to contribute to an accurate diagnosis of ES. In this context, this study explores the feature extraction ability of different pre-training strategies for distinguishing ES from other soft tissue or bone sarcomas with similar morphology in digitized tissue microarrays for the first time, as far as we know. Vision-language supervision (VLS) is compared to fully-supervised ImageNet pre-training within a multiple instance learning paradigm. Our findings indicate a substantial improvement in diagnostic accuracy with the adaption of VLS using an in-domain dataset. Notably, these models not only…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.