Deep learning models for representing out-of-vocabulary words

Johannes V. Lochter; Renato M. Silva; Tiago A. Almeida

arXiv:2007.07318·cs.CL·May 30, 2024

Deep learning models for representing out-of-vocabulary words

Johannes V. Lochter, Renato M. Silva, Tiago A. Almeida

PDF

TL;DR

This paper evaluates deep learning models for representing out-of-vocabulary words in NLP, highlighting that different tasks require different techniques, with Comick showing promising results by leveraging context and morphology.

Contribution

It provides a comprehensive performance evaluation of various deep learning methods for OOV words across multiple NLP tasks, introducing Comick as an effective approach.

Findings

01

Different OOV handling techniques perform best for different NLP tasks.

02

Comick, which uses context and morphology, yields promising results.

03

The choice of OOV method depends on the specific NLP application.

Abstract

Communication has become increasingly dynamic with the popularization of social networks and applications that allow people to express themselves and communicate instantly. In this scenario, distributed representation models have their quality impacted by new words that appear frequently or that are derived from spelling errors. These words that are unknown by the models, known as out-of-vocabulary (OOV) words, need to be properly handled to not degrade the quality of the natural language processing (NLP) applications, which depend on the appropriate vector representation of the texts. To better understand this problem and finding the best techniques to handle OOV words, in this study, we present a comprehensive performance evaluation of deep learning models for representing OOV words. We performed an intrinsic evaluation using a benchmark dataset and an extrinsic evaluation using…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.