Nonparametric Bayesian Double Articulation Analyzer for Direct Language Acquisition from Continuous Speech Signals
Tadahiro Taniguchi, Ryo Nakashima, and Shogo Nagasaka

TL;DR
This paper introduces a nonparametric Bayesian model that automatically discovers hierarchical language structures like words and phonemes directly from continuous speech without labeled data, mimicking infant language acquisition.
Contribution
The paper presents the HDP-HLM model and the NPB-DAA method, enabling unsupervised learning of language and acoustic models from raw speech signals, which is a novel approach in speech analysis.
Findings
Outperformed conventional double articulation analyzers in word and phoneme tasks
Successfully analyzed synthetic and real Japanese speech data
Automatically estimated hierarchical language structures without supervision
Abstract
Human infants can discover words directly from unsegmented speech signals without any explicitly labeled data. In this paper, we develop a novel machine learning method called nonparametric Bayesian double articulation analyzer (NPB-DAA) that can directly acquire language and acoustic models from observed continuous speech signals. For this purpose, we propose an integrative generative model that combines a language model and an acoustic model into a single generative model called the "hierarchical Dirichlet process hidden language model" (HDP-HLM). The HDP-HLM is obtained by extending the hierarchical Dirichlet process hidden semi-Markov model (HDP-HSMM) proposed by Johnson et al. An inference procedure for the HDP-HLM is derived using the blocked Gibbs sampler originally proposed for the HDP-HSMM. This procedure enables the simultaneous and direct inference of language and acoustic…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
