Geometrical Cross-Attention and Nonvoid Voxelization for Efficient 3D Medical Image Segmentation

Chenxin Yuan; Shoupeng Chen; Haojiang Ye; Yiming Miao; Limei Peng; Pin-Han Ho

arXiv:2604.05515·cs.CV·April 8, 2026

Geometrical Cross-Attention and Nonvoid Voxelization for Efficient 3D Medical Image Segmentation

Chenxin Yuan, Shoupeng Chen, Haojiang Ye, Yiming Miao, Limei Peng, Pin-Han Ho

PDF

TL;DR

GCNV-Net is a novel 3D medical image segmentation framework that combines geometrical cross-attention, dynamic voxel partitioning, and nonvoid voxelization to achieve high accuracy with reduced computational cost.

Contribution

The paper introduces GCNV-Net, integrating 3DNVT, GCA, and nonvoid voxelization for efficient and accurate 3D medical segmentation across diverse datasets.

Findings

01

Achieves state-of-the-art performance on multiple benchmarks.

02

Reduces FLOPs by 56.13% and inference latency by 68.49%.

03

Outperforms existing methods in Dice, IoU, NSD, and HD95 metrics.

Abstract

Accurate segmentation of 3D medical scans is crucial for clinical diagnostics and treatment planning, yet existing methods often fail to achieve both high accuracy and computational efficiency across diverse anatomies and imaging modalities. To address these challenges, we propose GCNV-Net, a novel 3D medical segmentation framework that integrates a Tri-directional Dynamic Nonvoid Voxel Transformer (3DNVT), a Geometrical Cross-Attention module (GCA), and Nonvoid Voxelization. The 3DNVT dynamically partitions relevant voxels along the three orthogonal anatomical planes, namely the transverse, sagittal, and coronal planes, enabling effective modeling of complex 3D spatial dependencies. The GCA mechanism explicitly incorporates geometric positional information during multi-scale feature fusion, significantly enhancing fine-grained anatomical segmentation accuracy. Meanwhile, Nonvoid…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.