TL;DR
The paper introduces OSA, a novel framework for echocardiography video segmentation that employs orthogonalized state updates and anatomical prior-aware feature enhancement to improve accuracy and stability.
Contribution
It proposes a constrained state evolution on the Stiefel manifold and a physics-driven feature enhancement to address noise and rank collapse in cardiac video segmentation.
Findings
Achieves state-of-the-art accuracy on CAMUS and EchoNet-Dynamic datasets.
Maintains real-time inference efficiency for clinical use.
Improves temporal stability and robustness against speckle noise.
Abstract
Accurate and temporally consistent segmentation of the left ventricle from echocardiography videos is essential for estimating the ejection fraction and assessing cardiac function. However, modeling spatiotemporal dynamics remains difficult due to severe speckle noise and rapid non-rigid deformations. Existing linear recurrent models offer efficient in-context associative recall for temporal tracking, but rely on unconstrained state updates, which cause progressive singular value decay in the state matrix, a phenomenon known as rank collapse, resulting in anatomical details being overwhelmed by noise. To address this, we propose OSA, a framework that constrains the state evolution on the Stiefel manifold. We introduce the Orthogonalized State Update (OSU) mechanism, which formulates the memory evolution as Euclidean projected gradient descent on the Stiefel manifold to prevent rank…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
