Multi-Syllable Phonotactic Modelling

Anja Belz (CCSRC; SRI International)

arXiv:cs/0102020·cs.CL·May 23, 2007·3 cites

Multi-Syllable Phonotactic Modelling

Anja Belz (CCSRC, SRI International)

PDF

Open Access

TL;DR

This paper introduces a new finite-state formalism called OFS Modelling for automatically constructing detailed multi-syllable phonotactic models across languages, capturing language-specific phonological patterns.

Contribution

It presents a novel multisyllable approach and a formalism that enables automatic, data-driven phonotactic modeling for multiple languages.

Findings

01

Successfully modeled German, English, and Dutch phonotactics.

02

Achieved close approximations of language-specific phonological forms.

03

Demonstrated language-independent prototype models.

Abstract

This paper describes a novel approach to constructing phonotactic models. The underlying theoretical approach to phonological description is the multisyllable approach in which multiple syllable classes are defined that reflect phonotactically idiosyncratic syllable subcategories. A new finite-state formalism, OFS Modelling, is used as a tool for encoding, automatically constructing and generalising phonotactic descriptions. Language-independent prototype models are constructed which are instantiated on the basis of data sets of phonological strings, and generalised with a clustering algorithm. The resulting approach enables the automatic construction of phonotactic models that encode arbitrarily close approximations of a language's set of attested phonological forms. The approach is applied to the construction of multi-syllable word-level phonotactic models for German, English and…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNatural Language Processing Techniques · Speech Recognition and Synthesis · Topic Modeling