ViTaL: A Multimodality Dataset and Benchmark for Multi-pathological Ovarian Tumor Recognition

You Zhou; Lijiang Chen; Guangxia Cui; Wenpei Bai; Yu Guo; Shuchang Lyu; Guangliang Cheng; Qi Zhao

arXiv:2507.04383·eess.IV·July 8, 2025

ViTaL: A Multimodality Dataset and Benchmark for Multi-pathological Ovarian Tumor Recognition

You Zhou, Lijiang Chen, Guangxia Cui, Wenpei Bai, Yu Guo, Shuchang Lyu, Guangliang Cheng, Qi Zhao

PDF

TL;DR

This paper introduces ViTaL, a comprehensive multimodal dataset and a novel neural network model for multi-pathological ovarian tumor recognition, significantly advancing early diagnosis capabilities.

Contribution

The paper provides the first multimodal dataset for ovarian tumor classification and proposes ViTaL-Net with THOAM for effective multi-pathology recognition.

Findings

01

Achieved over 90% accuracy on common ovarian tumor types

02

Overall classification accuracy reached 85%

03

Demonstrated effectiveness of multi-modal data fusion

Abstract

Ovarian tumor, as a common gynecological disease, can rapidly deteriorate into serious health crises when undetected early, thus posing significant threats to the health of women. Deep neural networks have the potential to identify ovarian tumors, thereby reducing mortality rates, but limited public datasets hinder its progress. To address this gap, we introduce a vital ovarian tumor pathological recognition dataset called \textbf{ViTaL} that contains \textbf{V}isual, \textbf{T}abular and \textbf{L}inguistic modality data of 496 patients across six pathological categories. The ViTaL dataset comprises three subsets corresponding to different patient data modalities: visual data from 2216 two-dimensional ultrasound images, tabular data from medical examinations of 496 patients, and linguistic data from ultrasound reports of 496 patients. It is insufficient to merely distinguish between…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.