TL;DR
This paper introduces an asymmetric 3D context fusion operator (A3D) that enhances 3D medical image analysis by better capturing 3D context, outperforming existing symmetric methods without high computational costs.
Contribution
The paper proposes a novel asymmetric 3D context fusion operator (A3D) that improves lesion detection accuracy by leveraging asymmetric fusion of 3D context from 2D slices.
Findings
A3D outperforms symmetric fusion operators on DeepLesion.
A3D establishes a new state-of-the-art in universal lesion detection.
A3D does not require large computational overhead.
Abstract
Modeling 3D context is essential for high-performance 3D medical image analysis. Although 2D networks benefit from large-scale 2D supervised pretraining, it is weak in capturing 3D context. 3D networks are strong in 3D context yet lack supervised pretraining. As an emerging technique, \emph{3D context fusion operator}, which enables conversion from 2D pretrained networks, leverages the advantages of both and has achieved great success. Existing 3D context fusion operators are designed to be spatially symmetric, i.e., performing identical operations on each 2D slice like convolutions. However, these operators are not truly equivariant to translation, especially when only a few 3D slices are used as inputs. In this paper, we propose a novel asymmetric 3D context fusion operator (A3D), which uses different weights to fuse 3D context from different 2D slices. Notably, A3D is NOT…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
