A Foundation Model for General Moving Object Segmentation in Medical   Images

Zhongnuo Yan; Tong Han; Yuhao Huang; Lian Liu; Han Zhou; Jiongquan; Chen; Wenlong Shi; Yan Cao; Xin Yang; Dong Ni

arXiv:2309.17264·cs.CV·February 28, 2024

A Foundation Model for General Moving Object Segmentation in Medical Images

Zhongnuo Yan, Tong Han, Yuhao Huang, Lian Liu, Han Zhou, Jiongquan, Chen, Wenlong Shi, Yan Cao, Xin Yang, Dong Ni

PDF

Open Access 1 Repo

TL;DR

This paper introduces iMOS, a foundation model for moving object segmentation in medical images, which achieves effective segmentation with minimal annotations, potentially accelerating medical image annotation and model development.

Contribution

The paper presents the first foundation model for medical moving object segmentation, enabling accurate segmentation with limited annotations and improving annotation efficiency.

Findings

01

iMOS achieves satisfactory segmentation with minimal annotations.

02

The model effectively tracks moving objects in medical image sequences.

03

Experiments validate the model's robustness across multi-modal datasets.

Abstract

Medical image segmentation aims to delineate the anatomical or pathological structures of interest, playing a crucial role in clinical diagnosis. A substantial amount of high-quality annotated data is crucial for constructing high-precision deep segmentation models. However, medical annotation is highly cumbersome and time-consuming, especially for medical videos or 3D volumes, due to the huge labeling space and poor inter-frame consistency. Recently, a fundamental task named Moving Object Segmentation (MOS) has made significant advancements in natural images. Its objective is to delineate moving objects from the background within image sequences, requiring only minimal annotations. In this paper, we propose the first foundation model, named iMOS, for MOS in medical images. Extensive experiments on a large multi-modal medical dataset validate the effectiveness of the proposed iMOS.…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

yzhn16/imos
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Neural Network Applications · Medical Image Segmentation Techniques · COVID-19 diagnosis using AI

MethodsSPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings