TL;DR
AnatomyNet is a deep learning model that rapidly and fully automates the segmentation of multiple head and neck anatomical structures from whole-volume CT images, improving accuracy and speed over previous methods.
Contribution
The paper introduces AnatomyNet, a novel 3D U-net based architecture with enhancements for whole-volume segmentation and handling missing annotations, advancing automated head and neck anatomy segmentation.
Findings
Achieved 3.3% higher Dice scores than previous state-of-the-art.
Segmented entire head and neck in about 0.12 seconds per image.
Successfully processed whole-volume CT images with minimal pre- and post-processing.
Abstract
Methods: Our deep learning model, called AnatomyNet, segments OARs from head and neck CT images in an end-to-end fashion, receiving whole-volume HaN CT images as input and generating masks of all OARs of interest in one shot. AnatomyNet is built upon the popular 3D U-net architecture, but extends it in three important ways: 1) a new encoding scheme to allow auto-segmentation on whole-volume CT images instead of local patches or subsets of slices, 2) incorporating 3D squeeze-and-excitation residual blocks in encoding layers for better feature representation, and 3) a new loss function combining Dice scores and focal loss to facilitate the training of the neural model. These features are designed to address two main challenges in deep-learning-based HaN segmentation: a) segmenting small anatomies (i.e., optic chiasm and optic nerves) occupying only a few slices, and b) training with…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
MethodsConcatenated Skip Connection · *Communicated@Fast*How Do I Communicate to Expedia? · Max Pooling · Convolution · U-Net · Focal Loss
