Automatic size and pose homogenization with spatial transformer network   to improve and accelerate pediatric segmentation

Giammarco La Barbera; Pietro Gori; Haithem Boussaid; Bruno; Belucci; Alessandro Delmonte; Jeanne Goulin; Sabine Sarnacki and; Laurence Rouet; Isabelle Bloch

arXiv:2107.02655·cs.CV·July 7, 2021

Automatic size and pose homogenization with spatial transformer network to improve and accelerate pediatric segmentation

Giammarco La Barbera, Pietro Gori, Haithem Boussaid, Bruno, Belucci, Alessandro Delmonte, Jeanne Goulin, Sabine Sarnacki and, Laurence Rouet, Isabelle Bloch

PDF

Open Access

TL;DR

This paper introduces a CNN with Spatial Transformer Networks for pediatric image segmentation that normalizes pose and size, leading to faster and more accurate segmentation results.

Contribution

The novel architecture integrates pose and scale normalization into the segmentation process, reducing training time and improving segmentation accuracy in pediatric CT images.

Findings

01

Homogenization accelerates segmentation by 25 hours versus data augmentation.

02

Achieves 88.01% Dice score for kidney segmentation.

03

Improves renal tumor delineation from 85.52% to 87.12%.

Abstract

Due to a high heterogeneity in pose and size and to a limited number of available data, segmentation of pediatric images is challenging for deep learning methods. In this work, we propose a new CNN architecture that is pose and scale invariant thanks to the use of Spatial Transformer Network (STN). Our architecture is composed of three sequential modules that are estimated together during training: (i) a regression module to estimate a similarity matrix to normalize the input image to a reference one; (ii) a differentiable module to find the region of interest to segment; (iii) a segmentation module, based on the popular UNet architecture, to delineate the object. Unlike the original UNet, which strives to learn a complex mapping, including pose and scale variations, from a finite training dataset, our segmentation module learns a simpler mapping focusing on images with normalized pose…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Neural Network Applications · Renal and related cancers · Pancreatic and Hepatic Oncology Research

MethodsMulti-Head Attention · Attention Is All You Need · Linear Layer · Absolute Position Encodings · Position-Wise Feed-Forward Layer · Adam · Dropout · Layer Normalization · Byte Pair Encoding · Label Smoothing