Revisiting Vicinal Risk Minimization for Partially Supervised Multi-Label Classification Under Data Scarcity
Nanqing Dong, Jiayi Wang, Irina Voiculescu

TL;DR
This paper introduces the first vicinal risk minimization approach for partially supervised multi-label classification in medical imaging, addressing data scarcity and improving model generalization.
Contribution
It presents a novel VRM-based method specifically designed for PSMLC, filling a gap in existing research and demonstrating its effectiveness.
Findings
VRM improves generalization in PSMLC tasks
Partially labeled data negatively impacts model performance
Augmenting partial labels can mitigate data scarcity issues
Abstract
Due to the high human cost of annotation, it is non-trivial to curate a large-scale medical dataset that is fully labeled for all classes of interest. Instead, it would be convenient to collect multiple small partially labeled datasets from different matching sources, where the medical images may have only been annotated for a subset of classes of interest. This paper offers an empirical understanding of an under-explored problem, namely partially supervised multi-label classification (PSMLC), where a multi-label classifier is trained with only partially labeled medical images. In contrast to the fully supervised counterpart, the partial supervision caused by medical data scarcity has non-trivial negative impacts on the model performance. A potential remedy could be augmenting the partial labels. Though vicinal risk minimization (VRM) has been a promising solution to improve the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsCancer-related molecular mechanisms research · Domain Adaptation and Few-Shot Learning · Machine Learning and Data Classification
