GEMTrans: A General, Echocardiography-based, Multi-Level Transformer Framework for Cardiovascular Diagnosis
Masoud Mokhtari, Neda Ahmadi, Teresa S. M. Tsang, Purang Abolmaesumi,, Renjie Liao

TL;DR
GEMTrans is a versatile transformer-based framework for echocardiography that enhances cardiovascular diagnosis by providing explainability and effectively processing multiple echo videos for tasks like ejection fraction estimation and aortic stenosis detection.
Contribution
It introduces a multi-level transformer model that captures intra- and inter-video relationships, offering explainability and improved accuracy across multiple diagnostic tasks.
Findings
Achieves 4.15 and 4.84 MAE in EF estimation with single and dual videos.
96.5% accuracy in aortic stenosis detection.
Provides task-specific attention maps for explainability.
Abstract
Echocardiography (echo) is an ultrasound imaging modality that is widely used for various cardiovascular diagnosis tasks. Due to inter-observer variability in echo-based diagnosis, which arises from the variability in echo image acquisition and the interpretation of echo images based on clinical experience, vision-based machine learning (ML) methods have gained popularity to act as secondary layers of verification. For such safety-critical applications, it is essential for any proposed ML method to present a level of explainability along with good accuracy. In addition, such methods must be able to process several echo videos obtained from various heart views and the interactions among them to properly produce predictions for a variety of cardiovascular measurements or interpretation tasks. Prior work lacks explainability or is limited in scope by focusing on a single cardiovascular…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsCardiac Valve Diseases and Treatments · Cardiac Imaging and Diagnostics · Phonocardiography and Auscultation Techniques
MethodsAttention Is All You Need · Linear Layer · Dropout · Byte Pair Encoding · Adam · Position-Wise Feed-Forward Layer · Multi-Head Attention · Absolute Position Encodings · Residual Connection · Label Smoothing
