# Enriching Rare Word Representations in Neural Language Models by   Embedding Matrix Augmentation

**Authors:** Yerbolat Khassanov, Zhiping Zeng, Van Tung Pham, Haihua Xu, Eng Siong, Chng

arXiv: 1904.03799 · 2021-01-14

## TL;DR

This paper introduces a method to improve neural language models by enriching rare word representations through embedding matrix augmentation, leading to better probability estimates and reduced error rates.

## Contribution

It proposes a novel approach to enhance rare word embeddings in pre-trained NLMs by leveraging similar words, improving speech recognition performance.

## Key findings

- Reduced word error rate by 6% relative
- Improved rare word recognition accuracy by 16%
- Effective augmentation of rare word embeddings

## Abstract

The neural language models (NLM) achieve strong generalization capability by learning the dense representation of words and using them to estimate probability distribution function. However, learning the representation of rare words is a challenging problem causing the NLM to produce unreliable probability estimates. To address this problem, we propose a method to enrich representations of rare words in pre-trained NLM and consequently improve its probability estimation performance. The proposed method augments the word embedding matrices of pre-trained NLM while keeping other parameters unchanged. Specifically, our method updates the embedding vectors of rare words using embedding vectors of other semantically and syntactically similar words. To evaluate the proposed method, we enrich the rare street names in the pre-trained NLM and use it to rescore 100-best hypotheses output from the Singapore English speech recognition system. The enriched NLM reduces the word error rate by 6% relative and improves the recognition accuracy of the rare words by 16% absolute as compared to the baseline NLM.

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/1904.03799/full.md

## Figures

2 figures with captions in the complete paper: https://tomesphere.com/paper/1904.03799/full.md

## References

32 references — full list in the complete paper: https://tomesphere.com/paper/1904.03799/full.md

---
Source: https://tomesphere.com/paper/1904.03799