TASL-Net: Tri-Attention Selective Learning Network for Intelligent Diagnosis of Bimodal Ultrasound Video
Chengqian Zhao, Zhao Yao, Zhaoyu Hu, Yuanxin Xie, Yafang Zhang,, Yuanyuan Wang, Shuo Li, Jianhua Zhou, Jianqiao Zhou, Yin Wang, Jinhua Yu

TL;DR
TASL-Net is a deep learning framework that mimics sonographers' attention mechanisms to improve bimodal ultrasound video diagnosis by integrating temporal, spatial, and bimodal attentions through a mutual transformer approach.
Contribution
The paper introduces TASL-Net, a novel network that embeds sonographers' diagnostic attention into a mutual transformer framework for bimodal ultrasound video analysis.
Findings
Achieves superior diagnostic accuracy on lung, breast, and liver datasets.
Effectively mimics sonographers' attention to improve interpretability.
Reduces computational redundancy with a novel video selector.
Abstract
In the intelligent diagnosis of bimodal (gray-scale and contrast-enhanced) ultrasound videos, medical domain knowledge such as the way sonographers browse videos, the particular areas they emphasize, and the features they pay special attention to, plays a decisive role in facilitating precise diagnosis. Embedding medical knowledge into the deep learning network can not only enhance performance but also boost clinical confidence and reliability of the network. However, it is an intractable challenge to automatically focus on these person- and disease-specific features in videos and to enable networks to encode bimodal information comprehensively and efficiently. This paper proposes a novel Tri-Attention Selective Learning Network (TASL-Net) to tackle this challenge and automatically embed three types of diagnostic attention of sonographers into a mutual transformer framework for…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsEducational Technology and Pedagogy · Advanced Technologies in Various Fields · Human auditory perception and evaluation
MethodsSoftmax · Attention Is All You Need · Focus · Convolution
