WSC-Trans: A 3D network model for automatic multi-structural segmentation of temporal bone CT
Xin Hua, Zhijiang Du, Hongjian Yu, Jixin Ma, Fanjun Zheng, Cheng, Zhang, Qiaohui Lu, Hui Zhao

TL;DR
This paper introduces a deep learning 3D network model that automatically segments complex small structures in temporal bone CT scans, significantly improving accuracy and efficiency for surgical planning in cochlear implantation.
Contribution
The study presents a novel combination of CNN and Transformer with attention mechanisms for multi-structure segmentation in temporal bone CT, outperforming existing methods.
Findings
Higher dice similarity scores for all structures
Lower HD95 and ASSD scores indicating better accuracy
Outperforms existing segmentation algorithms
Abstract
Cochlear implantation is currently the most effective treatment for patients with severe deafness, but mastering cochlear implantation is extremely challenging because the temporal bone has extremely complex and small three-dimensional anatomical structures, and it is important to avoid damaging the corresponding structures when performing surgery. The spatial location of the relevant anatomical tissues within the target area needs to be determined using CT prior to the procedure. Considering that the target structures are too small and complex, the time required for manual segmentation is too long, and it is extremely challenging to segment the temporal bone and its nearby anatomical structures quickly and accurately. To overcome this difficulty, we propose a deep learning-based algorithm, a 3D network model for automatic segmentation of multi-structural targets in temporal bone CT…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsFacial Nerve Paralysis Treatment and Research · Ear Surgery and Otitis Media · Hearing Loss and Rehabilitation
MethodsMulti-Head Attention · Attention Is All You Need · Linear Layer · Position-Wise Feed-Forward Layer · Label Smoothing · Layer Normalization · Softmax · Adam · Absolute Position Encodings · Byte Pair Encoding
