A statistical shape space model of the palate surface trained on 3D MRI   scans of the vocal tract

Alexander Hewer (DFKI; MMCI); Ingmar Steiner (DFKI; MMCI); Timo; Bolkart (MMCI); Stefanie Wuhrer (MORPHEO); Korin Richmond (CSTR)

arXiv:1602.07679·cs.CV·February 25, 2016

A statistical shape space model of the palate surface trained on 3D MRI scans of the vocal tract

Alexander Hewer (DFKI, MMCI), Ingmar Steiner (DFKI, MMCI), Timo, Bolkart (MMCI), Stefanie Wuhrer (MORPHEO), Korin Richmond (CSTR)

PDF

Open Access

TL;DR

This paper introduces a minimally-supervised statistical shape model of the palate surface derived from 3D MRI scans, demonstrating low error and broad applicability to various imaging modalities.

Contribution

A novel PCA-based method for creating a statistical palate shape model from MRI data with minimal supervision and cross-modality applicability.

Findings

01

Low reconstruction error with limited measured coordinates

02

Model generalizes well to new MRI and optical scan data

03

Applicable to other modalities like EMA and UTI

Abstract

We describe a minimally-supervised method for computing a statistical shape space model of the palate surface. The model is created from a corpus of volumetric magnetic resonance imaging (MRI) scans collected from 12 speakers. We extract a 3D mesh of the palate from each speaker, then train the model using principal component analysis (PCA). The palate model is then tested using 3D MRI from another corpus and evaluated using a high-resolution optical scan. We find that the error is low even when only a handful of measured coordinates are available. In both cases, our approach yields promising results. It can be applied to extract the palate shape from MRI data, and could be useful to other analysis modalities, such as electromagnetic articulography (EMA) and ultrasound tongue imaging (UTI).

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsPhonetics and Phonology Research · Speech Recognition and Synthesis · Bayesian Methods and Mixture Models