Hierarchical Spatio-temporal Segmentation Network for Ejection Fraction Estimation in Echocardiography Videos
Dongfang Wang, Jian Yang, Yizhe Zhang, Tao Zhou

TL;DR
This paper introduces a hierarchical spatio-temporal segmentation network that enhances ejection fraction estimation in echocardiography videos by combining local detail preservation with global dynamic analysis.
Contribution
It proposes a novel hierarchical network architecture with a spatio-temporal cross scan module to improve EF estimation accuracy in noisy ultrasound videos.
Findings
Improved EF estimation accuracy demonstrated on echocardiography datasets.
Effective integration of local and global features reduces noise impact.
Hierarchical design balances detail preservation with dynamic context understanding.
Abstract
Automated segmentation of the left ventricular endocardium in echocardiography videos is a key research area in cardiology. It aims to provide accurate assessment of cardiac structure and function through Ejection Fraction (EF) estimation. Although existing studies have achieved good segmentation performance, their results do not perform well in EF estimation. In this paper, we propose a Hierarchical Spatio-temporal Segmentation Network (\ourmodel) for echocardiography video, aiming to improve EF estimation accuracy by synergizing local detail modeling with global dynamic perception. The network employs a hierarchical design, with low-level stages using convolutional networks to process single-frame images and preserve details, while high-level stages utilize the Mamba architecture to capture spatio-temporal relationships. The hierarchical design balances single-frame and multi-frame…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
