An error correction scheme for improved air-tissue boundary in real-time MRI video for speech production
Anwesha Roy, Varun Belagali, Prasanta Kumar Ghosh

TL;DR
This paper introduces an error detection and correction scheme for improved segmentation of air-tissue boundaries in real-time MRI videos of speech production, addressing limitations of global evaluation metrics.
Contribution
It proposes a novel error correction method and two new localized evaluation metrics for better assessment of segmentation accuracy in specific contour regions.
Findings
Significant improvement in localized evaluation metrics (over 60%)
Enhanced segmentation accuracy in critical regions like velum and tongue base
Traditional DTW metric shows less improvement compared to proposed metrics
Abstract
The best performance in Air-tissue boundary (ATB) segmentation of real-time Magnetic Resonance Imaging (rtMRI) videos in speech production is known to be achieved by a 3-dimensional convolutional neural network (3D-CNN) model. However, the evaluation of this model, as well as other ATB segmentation techniques reported in the literature, is done using Dynamic Time Warping (DTW) distance between the entire original and predicted contours. Such an evaluation measure may not capture local errors in the predicted contour. Careful analysis of predicted contours reveals errors in regions like the velum part of contour1 (ATB comprising of upper lip, hard palate, and velum) and tongue base section of contour2 (ATB covering jawline, lower lip, tongue base, and epiglottis), which are not captured in a global evaluation metric like DTW distance. In this work, we automatically detect such errors and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTraditional Chinese Medicine Studies · Speech and Audio Processing · Advanced Computing and Algorithms
MethodsBalanced Selection · Dynamic Time Warping
