Infant Vocal Tract Development Analysis and Diagnosis by Cry Signals with CNN Age Classification
Chunyan Ji, Yi Pan

TL;DR
This paper presents a CNN-based method for early diagnosis of infant vocal tract development abnormalities using cry signals, achieving high accuracy in classifying healthy and pathological cries at 4 months old.
Contribution
The study introduces a novel CNN approach for age classification from cry signals, enabling early detection of vocal tract abnormalities in infants.
Findings
Achieved 79.20% accuracy for healthy cries
Achieved 84.80% accuracy for asphyxiated cries
Achieved 91.20% accuracy for deaf cries
Abstract
From crying to babbling and then to speech, infant's vocal tract goes through anatomic restructuring. In this paper, we propose a non-invasive fast method of using infant cry signals with convolutional neural network (CNN) based age classification to diagnose the abnormality of the vocal tract development as early as 4-month age. We study F0, F1, F2, and spectrograms and relate them to the postnatal development of infant vocalization. A novel CNN based age classification is performed with binary age pairs to discover the pattern and tendency of the vocal tract changes. The effectiveness of this approach is evaluated on Baby2020 with healthy infant cries and Baby Chillanto database with pathological infant cries. The results show that our approach yields 79.20% accuracy for healthy cries, 84.80% for asphyxiated cries, and 91.20% for deaf cries. Our method first reveals that infants'…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsInfant Health and Development · Voice and Speech Disorders · Speech Recognition and Synthesis
