3D Axial-Attention for Lung Nodule Classification
Mundher Al-Shabi, Kelvin Shak, Maxine Tan

TL;DR
This paper introduces a 3D Axial-Attention network for lung nodule classification that offers full 3D attention with reduced computational cost, outperforming previous methods on the LIDC-IDRI dataset.
Contribution
It proposes a novel 3D Axial-Attention mechanism with positional encoding, enabling efficient full 3D attention in lung nodule classification.
Findings
Achieved state-of-the-art performance on LIDC-IDRI dataset
Validated effectiveness with rigorous experimental setup
Demonstrated importance of full 3D attention for classification
Abstract
Purpose: In recent years, Non-Local based methods have been successfully applied to lung nodule classification. However, these methods offer 2D attention or limited 3D attention to low-resolution feature maps. Moreover, they still depend on a convenient local filter such as convolution as full 3D attention is expensive to compute and requires a big dataset, which might not be available. Methods: We propose to use 3D Axial-Attention, which requires a fraction of the computing power of a regular Non-Local network (i.e., self-attention). Unlike a regular Non-Local network, the 3D Axial-Attention network applies the attention operation to each axis separately. Additionally, we solve the invariant position problem of the Non-Local network by proposing to add 3D positional encoding to shared embeddings. Results: We validated the proposed method on 442 benign nodules and 406 malignant…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
MethodsConvolution
