Personalizing Universal Recurrent Neural Network Language Model with   User Characteristic Features by Social Network Crowdsouring

Bo-Hsiang Tseng; Hung-Yi Lee; and Lin-Shan Lee

arXiv:1506.01192·cs.CL·November 23, 2016

Personalizing Universal Recurrent Neural Network Language Model with User Characteristic Features by Social Network Crowdsouring

Bo-Hsiang Tseng, Hung-Yi Lee, and Lin-Shan Lee

PDF

TL;DR

This paper introduces a universal RNN language model personalized with user features derived from social network data, significantly improving speech recognition accuracy and reducing data sparseness.

Contribution

It proposes a novel approach to personalize language models using user characteristic features obtained via crowdsourcing from social networks, enabling shared models with individual differences.

Findings

01

Reduced model perplexity in experiments

02

Improved recognition accuracy in rescoring tests

03

Mitigated data sparseness problem for personalization

Abstract

With the popularity of mobile devices, personalized speech recognizer becomes more realizable today and highly attractive. Each mobile device is primarily used by a single user, so it's possible to have a personalized recognizer well matching to the characteristics of individual user. Although acoustic model personalization has been investigated for decades, much less work have been reported on personalizing language model, probably because of the difficulties in collecting enough personalized corpora. Previous work used the corpora collected from social networks to solve the problem, but constructing a personalized model for each user is troublesome. In this paper, we propose a universal recurrent neural network language model with user characteristic features, so all users share the same model, except each with different user characteristic features. These user characteristic features…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.