A Weighted Superposition of Functional Contours Model for Modelling Contextual Prominence of Elementary Prosodic Contours
Branislav Gerazov, G\'erard Bailly, Yi Xu

TL;DR
This paper introduces a weighted superposition model for prosody that captures the prominence of elementary contours based on context, improving the modeling of linguistic and paralinguistic information in speech.
Contribution
It extends the SFC model by incorporating weights for contours based on context, enabling better modeling of prominence and attitude effects in speech prosody.
Findings
WSFC effectively captures contour prominence.
Improves prosody modeling performance.
Models attitude and emphasis effects in speech.
Abstract
The way speech prosody encodes linguistic, paralinguistic and non-linguistic information via multiparametric representations of the speech signals is still an open issue. The Superposition of Functional Contours (SFC) model proposes to decompose prosody into elementary multiparametric functional contours through the iterative training of neural network contour generators using analysis-by-synthesis. Each generator is responsible for computing multiparametric contours that encode one given linguistic, paralinguistic and non-linguistic information on a variable scope of rhythmic units. The contributions of all generators' outputs are then overlapped and added to produce the prosody of the utterance. We propose an extension of the contour generators that allows them to model the prominence of the elementary contours based on contextual information. WSFC jointly learns the patterns of the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
