WSC-Trans: A 3D network model for automatic multi-structural   segmentation of temporal bone CT

Xin Hua; Zhijiang Du; Hongjian Yu; Jixin Ma; Fanjun Zheng; Cheng; Zhang; Qiaohui Lu; Hui Zhao

arXiv:2211.07143·eess.IV·November 15, 2022

WSC-Trans: A 3D network model for automatic multi-structural segmentation of temporal bone CT

Xin Hua, Zhijiang Du, Hongjian Yu, Jixin Ma, Fanjun Zheng, Cheng, Zhang, Qiaohui Lu, Hui Zhao

PDF

Open Access

TL;DR

This paper introduces a deep learning 3D network model that automatically segments complex small structures in temporal bone CT scans, significantly improving accuracy and efficiency for surgical planning in cochlear implantation.

Contribution

The study presents a novel combination of CNN and Transformer with attention mechanisms for multi-structure segmentation in temporal bone CT, outperforming existing methods.

Findings

01

Higher dice similarity scores for all structures

02

Lower HD95 and ASSD scores indicating better accuracy

03

Outperforms existing segmentation algorithms

Abstract

Cochlear implantation is currently the most effective treatment for patients with severe deafness, but mastering cochlear implantation is extremely challenging because the temporal bone has extremely complex and small three-dimensional anatomical structures, and it is important to avoid damaging the corresponding structures when performing surgery. The spatial location of the relevant anatomical tissues within the target area needs to be determined using CT prior to the procedure. Considering that the target structures are too small and complex, the time required for manual segmentation is too long, and it is extremely challenging to segment the temporal bone and its nearby anatomical structures quickly and accurately. To overcome this difficulty, we propose a deep learning-based algorithm, a 3D network model for automatic segmentation of multi-structural targets in temporal bone CT…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsFacial Nerve Paralysis Treatment and Research · Ear Surgery and Otitis Media · Hearing Loss and Rehabilitation

MethodsMulti-Head Attention · Attention Is All You Need · Linear Layer · Position-Wise Feed-Forward Layer · Label Smoothing · Layer Normalization · Softmax · Adam · Absolute Position Encodings · Byte Pair Encoding