Personalizing Universal Recurrent Neural Network Language Model with User Characteristic Features by Social Network Crowdsouring
Bo-Hsiang Tseng, Hung-Yi Lee, and Lin-Shan Lee

TL;DR
This paper introduces a universal RNN language model personalized with user features derived from social network data, significantly improving speech recognition accuracy and reducing data sparseness.
Contribution
It proposes a novel approach to personalize language models using user characteristic features obtained via crowdsourcing from social networks, enabling shared models with individual differences.
Findings
Reduced model perplexity in experiments
Improved recognition accuracy in rescoring tests
Mitigated data sparseness problem for personalization
Abstract
With the popularity of mobile devices, personalized speech recognizer becomes more realizable today and highly attractive. Each mobile device is primarily used by a single user, so it's possible to have a personalized recognizer well matching to the characteristics of individual user. Although acoustic model personalization has been investigated for decades, much less work have been reported on personalizing language model, probably because of the difficulties in collecting enough personalized corpora. Previous work used the corpora collected from social networks to solve the problem, but constructing a personalized model for each user is troublesome. In this paper, we propose a universal recurrent neural network language model with user characteristic features, so all users share the same model, except each with different user characteristic features. These user characteristic features…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
