Focal Modulation and Bidirectional Feature Fusion Network for Medical Image Segmentation

Moin Safdar; Shahzaib Iqbal; Mubeen Ghafoor; Tariq M.Khan; Imran Razzak; Thantrira Porntaveetus; and Hamid Alinejad-Rokny

arXiv:2510.20933·cs.CV·May 1, 2026

Focal Modulation and Bidirectional Feature Fusion Network for Medical Image Segmentation

Moin Safdar, Shahzaib Iqbal, Mubeen Ghafoor, Tariq M.Khan, Imran Razzak, Thantrira Porntaveetus, and Hamid Alinejad-Rokny

PDF

TL;DR

This paper introduces FM-BFF-Net, a hybrid CNN-transformer model with focal modulation and bidirectional feature fusion, achieving superior accuracy in diverse medical image segmentation tasks.

Contribution

It proposes a novel network architecture combining focal modulation and bidirectional feature fusion to improve boundary accuracy and robustness in medical image segmentation.

Findings

01

Outperforms state-of-the-art methods in Jaccard index and Dice coefficient across multiple datasets.

02

Enhances boundary precision and handles variations in lesion size, shape, and contrast.

03

Demonstrates effectiveness on polyp detection, skin lesion segmentation, and ultrasound imaging.

Abstract

Medical image segmentation is essential for clinical applications such as disease diagnosis, treatment planning, and disease development monitoring because it provides precise morphological and spatial information on anatomical structures that directly influence treatment decisions. Convolutional neural networks significantly impact image segmentation; however, since convolution operations are local, capturing global contextual information and long-range dependencies is still challenging. Their capacity to precisely segment structures with complicated borders and a variety of sizes is impacted by this restriction. Since transformers use self-attention methods to capture global context and long-range dependencies efficiently, integrating transformer-based architecture with CNNs is a feasible approach to overcoming these challenges. To address these challenges, we propose the Focal…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.