IAUNet: Instance-Aware U-Net
Yaroslav Prytula, Illia Tsiporenko, Ali Zeynalli, Dmytro Fishman

TL;DR
IAUNet introduces a novel query-based U-Net architecture with a lightweight decoder and Transformer refinement, achieving superior performance in biomedical cell instance segmentation and establishing a new benchmark dataset.
Contribution
The paper presents IAUNet, a new query-based U-Net model with a lightweight decoder and Transformer components, enhancing efficiency and accuracy in biomedical instance segmentation.
Findings
Outperforms state-of-the-art models on multiple datasets
Reduces model parameters with a lightweight decoder
Sets a new benchmark with the 2025 Revvity Full Cell Segmentation Dataset
Abstract
Instance segmentation is critical in biomedical imaging to accurately distinguish individual objects like cells, which often overlap and vary in size. Recent query-based methods, where object queries guide segmentation, have shown strong performance. While U-Net has been a go-to architecture in medical image segmentation, its potential in query-based approaches remains largely unexplored. In this work, we present IAUNet, a novel query-based U-Net architecture. The core design features a full U-Net architecture, enhanced by a novel lightweight convolutional Pixel decoder, making the model more efficient and reducing the number of parameters. Additionally, we propose a Transformer decoder that refines object-specific features across multiple scales. Finally, we introduce the 2025 Revvity Full Cell Segmentation Dataset, a unique resource with detailed annotations of overlapping cell…
Peer Reviews
Decision·ICLR 2025 Conference Withdrawn Submission
The key strengths of IAUNet lie in its innovative fusion of query-based instance segmentation with the proven U-Net architecture, specifically optimized for biomedical imaging challenges. Its main advantages are the three core designs: the dual-branch feature processing that effectively handles both mask and instance features separately, the double-center clustering mechanism that enhances object representation through dual queries per instance, and the parallel feature refinement strategy that
1. This work contains the basic typo, that should not appear: • Typo error: 182: display two question marks for reference in the second paragraph in chapter 3 'Model Overview' • Typo line 200: segmentation.n. • Typo line 415: questions mark? • Grammar mistake for sentence in line 239 to line 240 2. I think you lack the comparison with the SOTA cell segmentation methods originally proposed in other cell image modalities. For instance, StarDist(Paper Title: Nuclei Instance Segmentation
1. The incorporation of instance awareness in U-Net architecture for instance segmentation tasks, though not novel, becomes relevant due to U-Net’s widespread popularity in biomedical imaging. 2. The paper introduces Revvity-25, a new brightfield cell images dataset for instance segmentation. Although no specific details about the dataset (such as image statistics or accessibility) are provided, this new dataset could contribute to future benchmarking in biomedical imaging.
1. The main components of IAUNet are not significantly novel, with most of the components being derivative of existing works. For example: i) U-Net for instance segmentation has been explored in prior works such as "Cell Segmentation and Tracking using CNN-Based Distance Predictions and a Graph-Based Matching Strategy", ii) Query-based mechanisms for instance segmentation are well-known from models like DETR, MaskFormer, and Mask2Former, iii) Dual-path decoders and feature decoupling mechanisms
This paper introduce the Revvity Full Cell Segmentation Dataset, which comprises hundreds of images with thousands of manually annotated cell instances.
1. For biomedical instance segmentation, there are already relevant query-based works to on this topic, like [1,2]. However, this paper does not discuss the differences between its approach and theirs in the related works section. Besides, SAM also use query-based decoder. 2. Due to a misunderstanding in the first point, there is a deviation in the motivation described in the abstract. "While U-Net has been a go-to architecture in medical image segmentation, it was neither specifically designed
- Originality: This work proposed a novel 2025 Revvity Full Cell Segmentation Dataset containing hundreds of high-quality labels for future related research. - Significance: This work proposes a relatively novel approach that integrates the U-Net architecture with a Transformer-like structure.
- Regarding the lack of innovation in this paper, the Related Work section lacks analysis of recent work on query-based cell instance segmentation, such as PCTrans (ICCV 2023 Workshop) and so on, and does not highlight the advantages of this work compared to these studies. As a result, the value of this paper to the field is not apparent. - Concerning the results of the experiments, the metric FLOPs are generally about 1-3 times higher than existing works. With the comparison of all AP measures
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsCell Image Analysis Techniques · AI in cancer detection · Advanced Neural Network Applications
