HYATT-Net is Grand: A Hybrid Attention Network for Performant Anatomical Landmark Detection
Xiaoqian Zhou, Zhen Huang, Heqin Zhu, Qingsong Yao, S.Kevin Zhou

TL;DR
HYATT-Net is a hybrid CNN-Transformer model designed for accurate and efficient anatomical landmark detection in medical images, achieving state-of-the-art results across multiple datasets.
Contribution
The paper introduces HYATT-Net, a novel hybrid architecture combining CNNs and Transformers with specialized modules for improved ALD performance.
Findings
Achieves state-of-the-art accuracy on five datasets.
Demonstrates robustness and efficiency in complex medical images.
Outperforms existing methods in precision and computational cost.
Abstract
Anatomical landmark detection (ALD) from a medical image is crucial for a wide array of clinical applications. While existing methods achieve quite some success in ALD, they often struggle to balance global context with computational efficiency, particularly with high-resolution images, thereby leading to the rise of a natural question: where is the performance limit of ALD? In this paper, we aim to forge performant ALD by proposing a {\bf HY}brid {\bf ATT}ention {\bf Net}work (HYATT-Net) with the following designs: (i) A novel hybrid architecture that integrates CNNs and Transformers. Its core is the BiFormer module, utilizing Bi-Level Routing Attention for efficient attention to relevant image regions. This, combined with Attention Residual Module(ARM), enables precise local feature refinement guided by the global context. (ii) A Feature Fusion Correction Module that aggregates…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMedical Imaging and Analysis · Anatomy and Medical Technology · Dental Radiography and Imaging
MethodsCommunication--Guide||How Do I Communicate to Expedia? · Softmax · Attention Is All You Need · Sigmoid Activation · Max Pooling · Average Pooling · Dense Connections · Convolution · *Communicated@Fast*How Do I Communicate to Expedia? · How do i ask a question at Expedia?*AskExpertService
