Landmark Detection using Transformer Toward Robot-assisted Nasal Airway Intubation
Tianhang Liu, Hechen Li, Long Bai, Yanan Wu, An Wang, Mobarakol Islam,, Hongliang Ren

TL;DR
This paper introduces a transformer-based method for accurate landmark detection in robot-assisted nasal intubation, improving localization of nostrils and glottis with a deformable DeTR and semantic alignment.
Contribution
It proposes a novel transformer-based landmark detection approach with semantic alignment, tailored for small object detection in medical robotic procedures.
Findings
Achieved high detection accuracy on a public glottis dataset.
Successfully annotated and detected nostrils using the proposed method.
Code is publicly available for reproducibility.
Abstract
Robot-assisted airway intubation application needs high accuracy in locating targets and organs. Two vital landmarks, nostrils and glottis, can be detected during the intubation to accommodate the stages of nasal intubation. Automated landmark detection can provide accurate localization and quantitative evaluation. The Detection Transformer (DeTR) leads object detectors to a new paradigm with long-range dependence. However, current DeTR requires long iterations to converge, and does not perform well in detecting small objects. This paper proposes a transformer-based landmark detection solution with deformable DeTR and the semantic-aligned-matching module for detecting landmarks in robot-assisted intubation. The semantics aligner can effectively align the semantics of object queries and image features in the same embedding space using the most discriminative features. To evaluate the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsHead and Neck Surgical Oncology · Nasal Surgery and Airway Studies · Sinusitis and nasal conditions
MethodsMulti-Head Attention · Attention Is All You Need · Feedforward Network · Linear Layer · Adam · Dense Connections · Label Smoothing · Residual Connection · Dropout · Convolution
