Multi-instance Learning as Downstream Task of Self-Supervised Learning-based Pre-trained Model

Koki Matsuishi; Tsuyoshi Okita

arXiv:2505.21564·cs.CV·May 29, 2025

Multi-instance Learning as Downstream Task of Self-Supervised Learning-based Pre-trained Model

Koki Matsuishi, Tsuyoshi Okita

PDF

Open Access

TL;DR

This paper proposes using a self-supervised pre-trained model as a downstream task for deep multi-instance learning, significantly improving accuracy and F1 scores in brain hematoma CT image classification, especially with large instance bags.

Contribution

It introduces a novel approach of leveraging self-supervised pre-trained models to enhance multi-instance learning performance on large instance datasets.

Findings

01

Achieved 5-13% accuracy improvement.

02

Achieved 40-55% F1 score improvement.

03

Effective in addressing spurious correlation issues.

Abstract

In deep multi-instance learning, the number of applicable instances depends on the data set. In histopathology images, deep learning multi-instance learners usually assume there are hundreds to thousands instances in a bag. However, when the number of instances in a bag increases to 256 in brain hematoma CT, learning becomes extremely difficult. In this paper, we address this drawback. To overcome this problem, we propose using a pre-trained model with self-supervised learning for the multi-instance learner as a downstream task. With this method, even when the original target task suffers from the spurious correlation problem, we show improvements of 5% to 13% in accuracy and 40% to 55% in the F1 measure for the hypodensity marker classification of brain hematoma CT.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsImage Retrieval and Classification Techniques · AI in cancer detection · Digital Imaging for Blood Diseases