# HSSAM-Net: hyper-scale shifted aggregation network for precise colorectal polyp segmentation in endoscopic images

**Authors:** Qing Feng, Shahzad Ahmed, Yueming Zhang, Lan He, Muhammad Yaqub

PMC · DOI: 10.1038/s41598-025-21954-y · Scientific Reports · 2025-10-31

## TL;DR

This paper introduces HSSAM-Net, a deep learning model that improves colorectal polyp detection in colonoscopy images with high accuracy and real-time performance.

## Contribution

The novel HSSAM-Net framework combines a Hyper-Scale Shifted Aggregation Module and Max-Diagonal Pooling for efficient and accurate polyp segmentation.

## Key findings

- HSSAM-Net achieves state-of-the-art accuracy (Dice: 0.949–0.952, mIoU: 0.924–0.930) on five benchmark datasets.
- The model operates at 24.1 FPS with only 0.9 million parameters, making it suitable for real-time clinical use.
- It outperforms existing methods while maintaining efficiency and boundary refinement.

## Abstract

Colorectal cancer remains a leading cause of cancer-related mortality worldwide, emphasizing the importance of early detection through accurate polyp identification. However, colonoscopy relies heavily on precise polyp segmentation in endoscopic images, yet this task remains challenging due to morphological variability, low contrast, and imaging artifacts. In this study, we propose HSSAM-Net, a lightweight deep learning framework that integrates a Hyper-Scale Shifted Aggregation Module to capture multi-scale contextual information while preserving fine-grained details, Progressive Reuse Attention mechanism that strengthens feature propagation across the encoder-decoder pathway, and Max-Diagonal Pooling/Unpooling (MaxDP/MaxDUP) a novel dual-branch sampling scheme to improve texture representation, feature alignment to enhance feature aggregation, context learning, and boundary refinement. The proposed model is evaluated on five benchmark datasets (Kvasir, CVC-ClinicDB, ETIS, CVC-300, EndoCV2020). Experimental results show that HSSAM-Net consistently outperforms state-of-the-art methods across benchmark datasets, HSSAM-Net consistently achieves state-of-the-art accuracy (Dice: 0.949–0.952, mIoU: 0.924–0.930), while maintaining real-time efficiency at 24.1 FPS with only 0.9 M parameters. Furthermore, an analysis of trainable parameters and inference speed confirms its suitability for real-time clinical applications. Our findings demonstrate that HSSAM-Net achieves a favorable trade-off between accuracy and efficiency, advancing the development of practical and reliable computer-aided colonoscopy systems.

The online version contains supplementary material available at 10.1038/s41598-025-21954-y.

## Linked entities

- **Diseases:** colorectal cancer (MONDO:0005575)

## Full-text entities

- **Diseases:** colorectal polyp (MESH:D003111), Colorectal cancer (MESH:D015179), polyp (MESH:D011127), cancer (MESH:D009369)

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC12578911/full.md

## Figures

7 figures with captions in the complete paper: https://tomesphere.com/paper/PMC12578911/full.md

## References

17 references — full list in the complete paper: https://tomesphere.com/paper/PMC12578911/full.md

---
Source: https://tomesphere.com/paper/PMC12578911